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Preface 


This book introduces and develops the differential and integral calculus of func- 
tions of one variable. It is intended primarily as a textbook for a one-year course 
at the college or university level, but would also be suitable for an advanced 
placement course in secondary school. An alternative possibility would be to use 
only the first six chapters for an introductory course in calculus lasting one semester 
or term. 

The principal prerequisite for an intelligent reading of this book is a good 
background in secondary-school mathematics. Although the importance of set 
theory in the teaching of elementary mathematics may have been exaggerated, 
it is nonetheless a fact that most modern mathematics is done within the context 
of intuitive set theory, and we have used it informally. For example, a function 
is defined on page 13 as a set of ordered pairs. A short summary of some of the 
common terminology and notations of set theory is given in the third paragraph 
of Chapter 4 beginning on page 163. Mathematical induction is used occasionally 
in proofs, e.g., in the proof of the equation f. x” = nx"—! in (7.5) on page 61. 
However, mathematical induction is in no sense an essential prerequisite for using 
the book. No previous knowledge of trigonometry is actually necessary since the 
definitions and development of the trigonometric functions as functions of a real 
variable are self-contained. Nevertheless, an understanding of the elementary 
trigonometry of angles would certainly be helpful. 

As far as we know, there is no single, optimal approach to learning mathe- 
matics. One person learns best by slowly and carefully studying the details and 
logical structures, whereas another prefers to skim at first, work many problems, 
and return to the theory later—if at all. This book is designed to accommodate a 
variety of users. One result of this design has been the inclusion of the proofs of 
a number of theorems whose proofs would probably not be covered in a typical 
first-year course. They are included for completeness and also to satisfy the reader 
who, in a given instance, really wants to know WHY. However, to facilitate skip- 
ping, the proofs have been indented on the page and set in smaller type than that 
used in the main body of the text. 

In teaching the course out of which this book developed, the authors do not 
emphasize proofs and do not present many proofs in detail. However, it has been 
Our experience that students can absorb, and do appreciate, a clear and accurate 
statement of a mathematical proposition even when either no attempt has been 
made to supply the argument or only the intuitive idea behind the proof has been 
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presented. Some examples of specific theorems whose proofs we would normally 
omit in teaching a first-year course are: 


(7.5), page 127: The = form of L’H6pital’s Rule. 

(3.1), page 258: The Intermediate Value Theorem. 

(7.2), page 522: The fact that a power series can be differentiated term by term. 

(2.2), page 553: The detailed derivation of the integral formula for the arc 
length of a parametrized curve. 


In addition, the following are some sections of the book that we either skip entirely 
or mention only briefly in a beginning course: 


Section 7, Chapter 2: L’Hopital’s Rule. 
All of Chapter 3: Conic Sections. (Sufficient analytic geometry for 
specific examples in calculus is developed as needed.) 
Section 3, Chapter 5: Inverse Function Theorems. 
Section 5, Chapter 6: Algebraic and Transcendental Functions. 
Section 3, Chapter 8: Numerical Approximations (Continued). 
Section 6, Chapter 8: Integration of Discontinuous Functions. 


Section 6, Chapter 11: Hyperbolic Functions. 


It is our hope that the reader will also find this book useful as a reference 
work. In his subsequent mathematical training he may wish to return and study 
more deeply some of the details, which he omitted as a beginning student and 
which at a later time will appear more meaningful and relevant to his interests. 

The format should offer no problems. Definitions are set in boldface type, 
and theorems in italic. All theorems are numbered with two integers, such as (2.3). 
The more important ones in addition carry the label THEOREM, and the few 
that are sufficiently famous to have common names are labeled with them. Other 
propositions and formulas, which are technically also theorems but are not generally 
referred to as such, are simply numbered. The first integer indicates the section, 
and the second is a reference within the section. Thus (2.3) is the third theorem, 
proposition, or formula occurring in Section 2. The number of the chapter is not 
given, but a reference to (2.3) is usually accompanied by the number of the page on 
which it is found. If no page is given, then both the reference and the theorem 
referred to are in the same chapter. Finally, in individual sections a number, such 
as (6), may appear to the right of a given equation so that it may be referred to by 
that number later in the same section. 

We are deeply indebted to many people, and it is difficult to make adequate 
acknowledgments. First, we wish to thank those of our colleagues in the Dartmouth 
Mathematics Department who have given generously of their time with helpful 
advice and criticism. Among these, we are especially grateful to Professor Edward 
M. Brown for his interest and many valuable suggestions. In typing and preparing 
the manuscript we have had the cooperation from time to time of all the secretaries 
in the Department. In particular, our thanks are due to Mrs. Margaret P. Carter, 
Mrs. Helen Hanchett, Mrs. Nancy B. French, Miss Jane Overman, and Mrs. 
Linda P. Wieselquist. The students who have willingly learned their calculus from 
unfinished parts of the manuscript also deserve our thanks. We have been fortunate 
in having the assistance of Mr. Rudd A. Crawford, Jr., of the Concord-Carlisle 
School District and Harvard University, who prepared the answers to the problems. 
Finally, although Slesnick is still a bachelor, Crowell is not and owes an inestimable 
debt to his wife Marilyn for her patience and understanding throughout the writing 
of this book. 


September 1967 R.H.C. 
Hanover, N.H. W.E:S. 
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Functions, Limits, and 
Derivatives 


1. Real Numbers, Inequalities, Absolute Values. Calculus deals with 
numerical-valued quantities and, in the beginning, with quantities whose 
values are real numbers. Some understanding of the basic set R of all real 
numbers is therefore essential. A real number is one that can be written as a 
decimal: positive or negative or zero, terminating or nonterminating. 
Examples are 

l, —5, 0, 14, 

Sr oeGneG > 3 == 0.375, 

AND w.,,, 

—n7r = —3,141592..., 

176355.14233333.... 


The most familiar subset of R is the set Z of integers. These are the numbers 
peer ee OD UT C2, By cee. C4) 


Another subset is the set Q of all rational numbers. A real number + is 
rational if it can be expressed as the ratio of two integers, more precisely, if 


r = —, where mand 7 are integers and n # O. Since every integer m can be 
n 


written 77? it follows that every integer is also a rational number. A scheme, 


analogous to (1), which lists all the positive rational numbers is the follow- 
ing: 


~ 
ws 
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we 


we 
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Of course there are infinitely many repetitions in this presentation since, for 
example, ? = $ = 8 =.... An unsophisticated guess would be that all 
real numbers are rational. There are, however, many famous proofs that this 
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is not so. For example, a very simple and beautiful argument shows that »/2 
is not rational. (See Problem 13 at the end of this section.) It is not hard to 
prove that a real number is rational if and only if its decimal expansion 
beyond some digit consists of a finite sequence of digits repeated forever. 
Thus the numbers — 


ETES4921321 3213213273 . ore ver), 
1.500000000 . . . (forever) 


are rational, but 


0.101001000100001000001 . . . etc. 
is not. 

The fundamental algebraic operations on real numbers are addition and 
multiplication: For any two elements a and b in R, two elements a + 5 
and ab in R are uniquely determined. These elements, called the sum and 
product of a and 5b, respectively, are defined so that the following six facts 
are true: 


(Gj) ASSOCIATIVE LAWS 
a+(b+c)=(a+5))+¢, a(bc) = (ab)c. 
Gi) COMMUTATIVE LAWS 
a+t+b=b+a, apes bu; 
(ii) DISTRIBUTIVE LAW 
(a + b)c = ac + be. 


(iv) EXISTENCE OF IDENTITIES 
R contains two distinct elements 0 and 1 with the properties that 
0O+a=a and 1:a=a for everya in R. 


(v) EXISTENCE OF SUBTRACTION 
For every a in R, there is an element in R denoted by —a such that 
a+ (—a) = 0. [‘‘a — b” is an abbreviation of “a + (—5)).”] 
(vi) EXISTENCE OF DIVISION 
For every a # 0 in R, there is an element in R denoted by a~* or 


l 66 99 
— such that aa: = 1. ( ; is an abbreviation of “ab~.”) 
a 


Addition and multiplication are here introduced as binary operations. How- 
ever, as a result of the associative law of addition, a + b + c is defined to 
be the common value of (a+ 6) + canda-+ (6+ c). In a like manner 
we may define the triple product abc and, more generally, a, + -:: + a, 
and a,:::a,. Many theorems of algebra are consequences of the above six 
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facts, and we shall assume them without proof. They are, in fact, frequently 
taken as part of a set of axioms for R. 

Another essential property of the real numbers is that of order. We write 
a < bas an abbreviation of the statement that a is less than b. Presumably 
the reader, given two decimals, knows how to tell which one is the smaller. 
For him the following four facts simply recall the basic properties governing 
inequalities. On the other hand, they may also be taken as axioms for an 
abstractly defined relation between elements of R, which we choose to denote 
by =<. 


(vii) TRANSITIVE LAW 
Ifa < bandb <c, thena <c. 


(vii) LAW OF TRICHOTOMY 
For every real number a, one and only one of the following alter- 
natives holds: a = 0, ora < 0, or O < a. 


(ix) Ifa<b,thna+c<b-+e. 
(x) Ifa < band0 < c¢, thenac < oc. 


Note that each of the above statements except (vi) remains true when 
restricted to the set Z of integers. Moreover, all the statements are true for 
the set Q of rational numbers. Hence as a set of axioms for R, they fail to 
distinguish between two very different sets: R and its subset Q. Later in this 
section we shall add one more item to the list, which will complete the alge- 
braic description of R. 

A real number a is positive if 0 < a and negative if a < 0. Since the 
relation “‘greater than”’ is just as useful as “‘less than,” we adopt a symbol for 
it, too, and abbreviate the statement that a is greater than b by writing a > 5b. 
Clearly a > 6 if and only if b < a. Statement (x), when translated into 
English, says that the direction of an inequality is preserved if both sides are 
multiplied by the same positive number. Just the opposite happens if the 
number is negative: The inequality is reversed. That 1s, 


(14) da = page «=< 0. then.ac > be. 


Proof. Since c < 0, statements (iv), (v), and (ix) imply 
0=c+ (-c) <0+ (-c) = —c. 
So —c is positive. Hence by (x), we get —ac < —bc. By (ix) again, 
—ac + (bc + ac) < —be + (bc + ac). 


Hence be < ac, and this is equivalent to ac > be. 
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Two more abbreviations complete the mathematician’s array of symbols 
for writing inequalities: 


a= b means .a-<) ofa = 5, 
a> b means: qh eg bp, 


The geometric interpretation of the set R of all real numbers as a straight 
line is familiar to anyone who has ever used a ruler, and it is essential to an 
understanding of calculus. To describe the assignment of points to numbers, 
consider an arbitrary straight line L, and choose on it two distinct points, one 
of which we assign to, or identify with, the number 0, and the other to the 
number 1 (see Figure 1). The rest is automatic. The scale on L is chosen 


Smear eee ree Sy 8 Se ee eee 
0 l L 


Figure | 


so that the unit of distance is the length of the line segment between the points 
0 and 1. Every positive number a is assigned the point on the side of 0 con- 
taining 1 which is a units of distance from 0. Every negative number a is 
assigned the point on the side of 0 not containing 1 which is —a units of 
distance from 0. Note that if LZ is oriented so that 1 lies to the right of 0, 
then for any two numbers a and b (positive, negative, or zero), a < b if and 
only if a lies to the left of b. A line which has been identified with R under a 
correspondence such as the one just described is called a real number line 
(see Figure 2). 


J2 tr 

SSS SS 

—3 —2 =i 0 ] 2 3 4 
Figure 2 


An interval is a subset J of R with the property that whenever a and c 
belong to Janda < b < c, then balso belongs to 7. Geometrically an interval 
is a connected piece of a real number line. A number d is called a lower 
bound of a set S of real numbers if d < s for every sin S. It is an upper 
bound of S if s < d for every sin S. A given subset of R, and in particular 
an interval, is called bounded if it has both an upper and lower bound. There 
are four different kinds of bounded intervals: 


(a, b), the set of all numbers x such that a < x < BD; 
[a, b], the set of all numbers x such thata < x < b; 
[a, b), the set of all numbers x such thata < x < BD; 


(a, b], the set of all numbers x such thata < x < Bb. 
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In each case the numbers a and b are called the endpoints of the interval. The 
set [a, b] contains both its endpoints, whereas (a, b) contains neither one. 
Clearly [a, b) contains its left endpoint but not its right one, and an analogous 
remark holds for (a, 6]. It 1s important to realize that there is no element 
oo (infinity) in the set R. Nevertheless, the symbols « and —o are com- 
monly used in denoting unbounded intervals. Thus 


(a, oo) is the set of all numbers x such thata < x, 
[a, 00) is the set of all numbers x such that a < x, 
(—o,a) is the set of all numbers x such that x < a, 
(—«,a] is the set of all numbers x such that x < a, 
(— 00, 0) is the entire set R. 


The symbols « and —« also appear frequently in inequalities although 
they are really unnecessary, because, for example, 


—o <x <a 1s equivalent to x < a, 
a<x < o is equivalent toa < x, 


etc. Since o is not an element of R, we shall never use the notations [a, o ], 
x < ow, etc. An unbounded interval has either one endpoint or none; in 
each of the above cases it is the number a. We call an interval open if it con- 
closed 

neither 


open 


closed 


open 


=o =2 = are 2 3 + 


Figure 3 


tains none of its endpoints, and closed if it contains them all. Thus, for 
example, (a, b) and (— «, a) are open, but [a, b] and [a, «) are closed. The 
intervals [a, 6) and (a, b] are neither open nor closed, although they are 
sometimes called half-open or half-closed. Since (— «, «) has no endpoints, 
it vacuously both does and does not contain them. Hence (— », «) has the 
dubious distinction of being both open and closed. 
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EXAMPLE |. Draw the intervals [0, 1], [—1, 4), (2, 0), (—#, —1], (-—1, 3), 
and identify them as open, closed, neither, or both (see Figure 3). 


It is frequently necessary to talk about the size of a real number without 
regard to its sign, not caring whether it is positive or negative. This happens 
often enough to warrant a definition and special notation: The absolute value 
of a real number a is denoted by |a| and defined by 


a ne 9, 
lal = 
—a iia <0. 


Thus |3| = 3, |0| = 0, |—3| = 3. Obviously, the absolute value of a real 
number cannot be negative. Geometrically, |a| is the distance between the 
points 0 and a on the real number line. A generalization that is of extreme 
importance is the fact that |a — b| is the distance between the points a and b 
on the real number line for any two numbers a and b whatsoever. Probably the 
best way to convince oneself that this is true is to look at a few illustrations 
(see Figure 4). 


—+—_@—_+__@——+-_ 3 —1)=2=]1-3], 


0 1 7 3 4 
—@——_}+—__+—__+—@-  |-1-3|=4, 
=a 0 1 Z 3 


———_$+—__}_+—_@ ((-3)-1|=]1-(-3)|=4. 
—3 —2 “a f 0 ] 


Figure 4 


EXAMPLE 2. Describe the set J of all real numbers x such that |x — 5| < 3. 
For any number x, the number |x — 5] is the distance between x and 5 ona 
real number line (see Figure 5). That distance will be less than 3 if and only 


poet ool 
pt} tp ttt tt tt pt 


—1 0 1 rs BL 4 5 6 7 8 9 


Figure 5 


if x satisfies the inequalities 2 << x < 8. We conclude that J is the open 
interval (2, 8). 

There is an alternative way of writing the definition of the absolute 
value of a number a which requires only one equation: We do not have to give 
separate definitions for positive and negative a. This definition uses a square 
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root, and before proceeding to it, we call attention to the following mathe- 
matical custom: Although every positive real number a has two square roots, 
in this book the expression \/a always denotes the positive root. Thus the two 
solutions of the equation x” = 5 are \/5 and —\/5. Note that the two equa- 
tions 

bee and x= Va 

are not equivalent. The second implies the first, but not conversely. On the 
other hand, 

=a and Ix] = Va 


are equivalent. Having made these remarks, we observe that 


(1.2) la| = Va. 


The formulation (1.2) is a handy one for establishing two of the basic 
properties of absolute value. They are 


(1.3) Jab] = [al |B]. 
(1.4) la 8] <.la| +10), 


Proofs. Since (ab)? = ab? and since the positive square root of a product of two 
positive numbers is the product of their positive square roots, we get 


lab] = V/ (ab)? = V/a?b? = V/a2V/b? = [al 2}. 
To prove (1.4), we observe, first of all, that ab < |ab|. Hence 
a® + 2ab + b? < a? + 2\ab| + b? = Jal? + 2Ial |b) + [d)?. 
Thus, 
Borne =a by < {alt |b). 


By taking the positive square root of each side of the inequality (see 
Problem 9), we get (1.4). 


As remarked above, our list of basic facts (1) through (x) about the set R 
of real numbers is incomplete. One important property of real numbers that 
together with the others gives a complete characterization is the following: 


(xi) LEAST UPPER BOUND PROPERTY 
Every nonempty subset of R which has an upper bound has a least 
upper bound. 


Suppose S is a nonempty subset of R which has an upper bound. What 
(xi) says is that there is some number b which (1) is an upper bound, i.e., 
s < b for every sin S, and (2) if cis any other upper bound of S, then b < c. 
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It is hard to see at first how such a statement can be so significant. Intuitively 
it says nothing more than this: If you cannot go on forever, you have to stop 
somewhere. Note, however, that the rational numbers do not have this prop- 
erty. The set of all rational numbers less than the irrational number 1/2 
certainly has an upper bound. In fact, each of the numbers 2, 1.5, 1.42, 1.415, 
1.4143, and 1.41422 is an upper bound. However, for every rational upper 
bound, there will always exist a smaller one. Hence there is no rational least 
upper bound. 


PROBLEMS 


1. Draw the following intervals and identify them as bounded or unbounded, 
closed or open, or neither: (2, 4), [3,5], (—%, —2], [1.5, 2.5), (2, 7). 

2. Draw each of the following subsets of R. For those that are given in terms of 
absolute values write an alternative description that does not use the absolute 
value. 


(a) Set of all x-such that-4 < x. <= 43. 

(b) Set of all x such thatO < x < 0%. 

(c) Set-of all x such thats <= x=. 

(d) Set of all x such that |x| > 2. 

(e) Set of all y such that 1 < |y| < 3. 

(f) Set of all z such that |z — 2| < 1. 

(g) Set of all x such that |x — al > 0. 

(h) Set of all w such that 1 < |u — 1| < 5. 


3. Prove the following facts about inequalities. [Hint: Use (villi), (ix), (x), (1.1), 
and the meanings of > and <. In each problem you will have to consider 
several cases separately, e.g.,a > Oanda = 0.] 


(a) Ifa < b,thena+c< b+. 

(b) Ifa > 6b, thena+c>b+c. 

(c)°lf @ < ‘b and 'c’>-0,; then ae = de. 

(id) a < band c <0, thende > we, 
4. Prove that a is positive (negative) if and only if ; is positive (negative). 
St Oe ee, prove that - =. 


ola > cand b < 0, prove that = a 
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7. Ik a.< b < .c, prove that 


=~ 


Cy. | Oe 


Ha > 0, 


> Boers). 


Oop Ss 
Q1cm 


8. Does the set Z of integers have the Least Upper Bound Property? That is, if a 
nonempty subset of Z has an upper bound, does it have a smallest one? 
9. Show that if 0 < a < b, then0 < Va < Vo. 

10. Prove that a = bif and only ifa < bandd < a. 

11. Show that the Least Upper Bound Property implies the Greatest Lower Bound 
Property. That is, using (xi), prove that if a nonempty subset of R has a lower 
bound, then it has a greatest lower bound. 

12. Verify the assertion made in the text that if an interval is bounded it must be 
one of four types: (a, 5), [a, b], (a, 6], or [a, b). (Hint: See Problem 11.) 

13. Prove that \/2 is irrational. (Hint: The proof, which is elegant and famous, 


starts by assuming that 1/2 = ; , where p and gq are integers not both even. 


A contradiction can then be derived.) 


2. Ordered Pairs of Real Numbers, the xy-Plane, Functions. The set whose 
members consist of just the two elements a and b is denoted {a,b}. The 
notation is not perfect because it suggests that the members a and b have been 
ordered: a is written first and b second. Actually no ordering is present 
because {a, b} = {b, a}. Note also that ifa = b, then {a, b} = {b,a} = {a}. 
It can happen, however, that the ingredient of order is essential. We therefore 
introduce the notion of an ordered pair (a, b) whose first member is a and 
whose second member is b. The characteristic property of ordered pairs is 


(2, BY = (c,d) if and only if a = c and b = d. 


In particular (a, b) = (6, a) if and only if a = b. In Section 1 we saw that 
the set R of all real numbers can be thought of as a straight line. We shall 
now show that every ordered pair (a, b) of real numbers a and b can be 
identified with a point in a plane. This brings up a notational problem: Is 
(5, 7) the ordered pair of real numbers or is it the open interval consisting of 
all x such that 5 < x < 7? The answer is that it is impossible to tell out of 
context—just as it is impossible to tell whether the word “‘well’”’ is the noun 
or the adverb. 

Consider two distinct real number lines drawn in a plane so that they 
intersect at the number O on each line. One of the lines is traditionally drawn 
horizontal and called the x-axis, and the other is made perpendicular to it 
and called the y-axis. The orientation is chosen so that the number 1 on the 
x-axis lies to the right of 0, and the number 1 on the y-axis is above 0. It is 
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also customary to use the same scale of distances on both axes. For every 
ordered pair (a, b) of real numbers, let L, be the line parallel to the y-axis 
that cuts the x-axis at a, and let M; be the line parallel to the x-axis that cuts 
the y-axis at b. We assign the point of intersection of L, and M,j to the ordered 
pair (a, b) (see Figure 6). The numbers a and 5b are called the coordinates of 


Figure 6 


the point. a is the x-coordinate (or abscissa) and b is the y-coordinate (or 
ordinate). 

If the pairs (a, b) and (c, d) are not equal, then the points in the plane 
assigned to them will be different. In addition, every point in the plane has a 
number pair assigned to it: Starting with a point, draw the two lines through 


Figure 7 


it which are parallel to the x-axis and the y-axis. One line cuts the x-axis at a 
number a, and the other cuts the y-axis at b. The ordered pair (a, 5) has the 
original point assigned to it. It follows that our assignment 


pair — point 
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is a One-to-one correspondence between the set of all ordered pairs of real 
numbers, which we denote by R’, and the set of all points of the plane. It is 
convenient simply to identify R® with the plane together with the two axes. 


EXAMPLE |. Plot the points (1, 2), (—2,3), (0,1), (4,0), (—2, —3), and 
(2, —3) on the xy-plane (see Figure 7). 


The usefulness of the idea of an ordered pair is by no means limited to 
pairs of real numbers. In plane geometry, for example, we may consider the 
set of all ordered pairs (T, p) in which T is a triangle and p is the point of 
intersection of its medians. In the three-dimensional extension of the 
xy-plane, the set R® of all ordered triples (a, b, c) of real numbers is identified 
with the set of all points in three-dimensional space. The definition of an 
ordered triple can be reduced to that of an ordered pair by defining (a, b, c) 
to be ((a, b), c). 


Figure 8 
Let P = (a, b) and Q = (c, d) be arbitrary elements in the set R? of all 


ordered pairs of real numbers. We define the distance between P and Q by 
the formula | 


ae 


Three simple corollaries of this definition are: 


(2.1) Distance (P, Q) > 0; 1.e., distance is never negative. 


12 FUNCTIONS, LIMITS, AND DERIVATIVES  [cuap. 1 


(2.2) Distance (P, Q) = Oif and only if P = Q. 
(2.3) Distance (P, Q) = distance (Q, P). 


Another consequence of (1) is that it is no longer simply a matter of 
tradition and convenience that we draw the y-axis perpendicular to the 
x-axis. It follows from consideration of the Pythagorean Theorem and its 
converse (see Figure 8) that the above definition of distance between elements 
of R’ corresponds with our geometric notion of the distance between points 
in the Euclidean plane if and only if the two coordinate axes are perpendicular 
and the scales are the same on both. 


EXAMPLE 2. Let C be the subset of the xy-plane consisting of all points whose 
distance from (1, 1) is equal to 2. Thus C is the circle shown in Figure 9. 


| y-axis 


Figure 9 


If (x, y) is an arbitrary point in the xy-plane, its distance from (1, 1) is equal 


tov/( — 1)? + & — 1). Hence,{ 4.9) belongs to C if and on 


V(x — 12+ — 1)? = 2. (2) 
Numbers x and y satisfy (2) if and only if they satisfy 
(ey ay (3) 


Thus C is the set of all ordered pairs (x, y) that satisfy (3)—or that satisfy (2). 
Either (2) or (3) is therefore called an equation of the circle C. 


The set of all points (x, y) in the plane that satisfy a given equation is 
called the graph of the equation. Hence, in the above example, the circle C 
is the graph of the equation (x — 1)? + (y — 1)? = 4. 
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EXAMPLE 3. Let L be the set of all ordered pairs (x, y) such that y = 2x — 3. 
For each real number x, there is one and only one number y such that (x, y) 
belongs to L: y = 2x — 3. To see what L looks like, we plot five of its points 
(see Table 1). As shown in Figure 10, all these points lie on a straight line. 


In Section 5 we shall justify the natural conjecture that this straight line is the 
set L. 


TABLE 1 


x-axis 


Figure 10 


EXAMPLE 4. The set of all pairs (x, y) such that y? = x is the curve shown in 
Figure 11. This curve is a parabola, one of the conic sections, which are 
studied in greater detail in Chapter 3. At present we shall be satisfied with 
plotting a few points and connecting them with a smooth curve (see Table 2). 


TABLE 2 


Figure 11 


A function f is any set f of ordered pairs such that whenever (a, 6) and 
(a, c) belong to f, then b = c. Note that every subset of the xy-plane is a set 
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of ordered pairs, but not every subset is a function. In particular, the parab- 
ola in Example 4 is not, because it contains both (4, 2) and (4, —2). On the 
other hand, the straight line in Example 3 is a function. This condition that a 
function must never contain two pairs (a, b) and (a,c) with b ¥ c means 
geometrically that a subset of the xy-plane is a function if and only if it 
never intersects a line parallel to the y-axis in more than one point. Hence it 
is an easy matter to decide which of the following sets are functions and which 
are not: 


(i) The set f of all pairs (x, y) such that y = x + -1. 
(ii) The set g of all pairs (x, y) such that x? + y? = 1. 
(iii) The set F of all pairs (x, y) such that y = x? + 2x + 2. 
(iv) The set’ of all pairs (x, y) such that 2x + 3y = 1. 
(v) The set G of all pairs (x, y) such that y = \/x + 2. 
(vi) The set H of all pairs (x, y) such that y* = x. 


The sets f, F, h, and G are functions, but g and H are not. 

The domain of a function fis the set of all elements a for which there is a 
corresponding b such that (a, b) belongs to f. Analogously, the range of f 
is the set of all elements b for which there is an a such that (a, b) belongs to /. 
In (i), the domain of fis the set R of all real numbers and so is the range. On 
the other hand, in (iii), although the domain of Fis equal to R, the range is the 
interval consisting of all real numbers y > 1, because we can write x° + 
2xt+2=(x+ 174+ 121. 

If a pair (a, b) belongs to a function f, we call b the value of f at a and 
write b = f(a). Note that the meaning of f(a) is unambiguous only because 
the definition of a function forbids having (a, b) and (a, c) both belong to f 
if b ¥ c. Therefore the second member of any ordered pair that belongs to f 
is determined by the first member. 


EXAMPLE 5. In (i), 
fij= 2 dM, f@=a-+l, 
fo = iGy = C+ 4+) Sa 
f(-D=-14+1=0, f@tb=a+b4+1. 
In (v), 
Gaj=VEt2, GOR) = van pe 
G0) = v2, G(—2) = 9, 
GO)2= 2; G(—3) is not defined. 
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To each element a in the domain of a function f there corresponds a 
value f(a) in the range. This correspondence between domain and range, 
which is pictured in Figure 12, is the central idea in the definition of a function. 
Thus the function f that consists of all ordered pairs (x, y) such that y = x? 
and —1 < x < 21s interpreted as the rule of correspondence which assigns 
to each number in the interval [— 1, 2]its square. We can describe f completely 
and simply by writing 


(iC) ae ae ol =< x < 2, 


domain f 
Figure 12 
Examples of other functions are 
g(x) = Vx +1, oe 
F(x) = x’, —0O <x <.m, 
x 
kat x 2, 


Note that the functions f and F immediately above are not equal, although f 
is a subset of F. Two functions are equal if they are one and the same set of 
ordered pairs. It follows that 


(2.4) Functions f and g are equal if and only if they have the same domain D 
and f(x) = g(x) for every element x in D. 


Thus any complete description of a function must include a description 
of its domain. Sometimes this information is in fact omitted. We shall adopt 
the convention that if no explicit description of the domain of a function 1s 
given, then its domain is assumed to be the largest set of real numbers that 
makes sense. For example, the domain of the function H defined by 


Il l 
MO) = = x= 2" @EDE-D 


is assumed to be the entire set of real numbers with the exception of — 1 and 2. 
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It is sometimes helpful to think of a function as a computing machine. 
Imagine a computing machine, named f, which is provided with an input 
tape, an output tape, and a button (see Figure 13). One writes a number x 
on the input tape and pushes the button. If x is one of the inputs which the 
machine will accept, i.e., if x is in the domain of f, the machine whirs con- 
tentedly and prints an output, which we denote f(x), on the output tape. If x 
is not in the prescribed domain, either nothing happens or a red light flashes. 


button 


Figure 13 


We have already seen that one of the best ways of describing a subset of 
R? is to draw a picture of it. If this subset happens to be a function, we call 
the picture the graph of the function. More specifically, if a function f is a 
subset of R, its graph is the set of all points in the plane that correspond to 
ordered pairs of the form (x, f(x)). Note that the graph of f depends on the 
correspondence between ordered pairs and points; i.e., it depends on the 
choice of axes. To illustrate this, in Figure 14 we have drawn the graph of 


Figure 14 


the function f defined by f(x) = x° for two sets of axes. Fora single choice of 
axes, we simply identify ordered pairs and points, and under this identification 
a function and its graph become the same thing. 

Most of the functions encountered in an introduction to calculus are 
defined by means of a single equation; e.g., h(x) = x* + 3. It is a bad mis- 
take, however, to assume that this is always true. The function F given by 


ae | ifx > 0, 
F(x) = x 


Se ix =<). 
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requires two equations for its definition. The graph of F is shown in Figure 15. 
Another function, which is so wild that it is impossible to draw its graph, is 
the following: 


0 if x is rational, 
g(x) = aS 

l if x is irrational. 
The ordered pairs that comprise a function are not necessarily pairs of 
numbers. An example is the function, mentioned earlier in this section, which 
assigns to each triangle the point of intersection of its medians. It is possible 
for the domain of a function to be a set of ordered pairs. Consider the 
function f consisting of all ordered pairs ((x, y),z), where x, y, and z are 


y-axis 


Figure 15 


numbers that satisfy x > y and z = 2x? + y®. We describe this function 
simply as follows: 


oye 2x ty, Ox p. (4) 


As a final example of a function, consider the rule of correspondence that 
assigns to each person his or her male parent. 

As we have indicated, the definition of a function is appallingly general. 
One of our tasks is to delineate properly the kinds of functions studied in 
calculus. To begin with, a function f is said to be real-valued if its range is a 
subset of R, the set of real numbers. If the domain of fis a subset of R, we 
call fa function of a real variable. The function f(x, y) defined in (4) has as its 
domain a subset of R’. It is a real-valued function of two real variables. For 
the most part, a first course in calculus is a study of real-valued functions of 
one real variable. 


PROBLEMS 


1. Plot the following points in the xy-plane: (0, —2), (1, 3), (3,1), (—4, —4), 
and (5, 0). 

2. In the xy-plane plot the points (1, 2) and (2, 1), (—3, 2) and (2, —3), (—2, —3) 
and (—3, —2). Describe the relative positions of the points (a, 6) and (6, a) 
for arbitrary a and 5. 
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3. The x-axis and the y-axis divide R? into four quadrants, as shown in Figure 16. 
Let (a, b) be a point for which neither a nor 6 is zero. How can you recognize 
instantly which quadrant (a, b) belongs to? 


y-axis 


Figure 16 


4. Find the distance between 
7 2) and G; 4), 
(2, 3) and (3, 2), 
(3, 4) and (—1, 2), 
(—2,-—1) and (2,1). 

In each case plot the points in R?. 
5. Verify Proposition (2.4). 
. Plot the subsets of the xy-plane defined in (i) through (vi), page 14. 


7. In each of the following, plot the subset of R? that consists of all pairs (x, y) 
such that the given equation (or conditions) is satisfied. 


ON 


fay Ie fy = 2 (h) 4x? — y? = 

(b) x+y=1 oe ye 4+ x2 

(c) y = |x| (j) y = [x3 

(d) y= vx (k) y = largest integer less than 
ey OUP or equal to x 

() xt + 4y? = ages iga  © is: 
(gy a ye ee a 


oe) 


. In Problem 7, which subsets are functions? 
9. Let fand g be two functions defined, respectively, by 
fe) =x? +x41, —2 ix < w, 


Ah 


7k for every real number x except x = 1. 
X.. — 


g(x) = 


Find: 
(a) f(2), f(0), f(a), fla + 5), fla — 4). 
(b) g(0), g(—1J), g(10), g(5 + 2), g(x’). 
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10. Give an example of a function fand a function g that satisfy each of the follow- 
ing conditions. 


(a) domain f = domain g, but range f ¥ range g. 

(b) domain f ~ domain g, but range f = range g. 

(c) domain f = domain g and range f = range g, but f # g. 

(d) f(@ = g(a) for every a that belongs to both domains, but SF g. 


11. What is the assumed domain of each of the following functions? 


ee eine <4 Gas oes at 
(oy fx) of 2? (f) F(x) = V2 — Bx — 20 
ae (g) The set of all ordered pairs (x, y) 
eee Fe such that 
(c) g(x) ae eee 
eee 
(dy. fx) = $2 ones * 


3. Operations with Functions. If f and g are two functions, a new function 
f(g), called the composition of g with /, is defined by 


(f(g))() = f(g(a). 


For example, if f(x) = x? — 1 and g(x) = = es , then 
(7(g))(x) = f(g(x)) = (g(x)? -— 1 
fics Iitetpa 12Gn'-bl). 
do Beate Be 


The composition of two functions is the function obtained by applying one 
after the other. If f and g are regarded as computing machines, then f(g) is 
the composite machine constructed by feeding the output of g into the input 
of f as indicated in Figure 17. In general it is not true that f(g) = g(f). In 


Rea ee = 
| | 
8 (a) | 
| 
ee 3 ; 


Figure 17 
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the above example we have 


(e(N)G) = s(feo) = FA 


JAX) rreck 
ee ae ae 
a a fg a = 


and the two functions defined by (1) and (2) are certainly not the same. In 
terms of ordered pairs the composition f(g) of g with f is formally defined to 
be the set of all ordered pairs (a, c) for which there is an element 6 such that 
b = g(a) andc = f(b). 

If f and g are two real-valued functions, we can perform the usual arith- 
metic operations of addition, subtraction, multiplication, and division. Thus 


] 
for the functions f(x) = x* — 1 and g(x) = a , we have 


x-—l1 
f(x) + ge) = x°- 1 +245. 
f(@) — gx) = x° 1-255. 
fee) = @? -— NES 
= (x +x4+ 1x4 1) ifx ~ l, 
ame | 


Just as with the composition of two functions, each arithmetic operation 
provides a method of constructing a new function from the two given func- 
tions f and g. The natural notations for these new functions are f+ g, 


f -— g, fg, and I They are defined by the formulas 
& 


(f + g)(x) = f(x) + g(x), 
(f — g(x) = f(x) — g(x), 
(fex) = f(x)g(x), 


(ore. 
a, = $6) if g(x) # 0. 


The product function fg should not be confused with the composite function 
f(g). For example, if f(x) = x° and g(x) = x°, then we have 
(fg)(x) = f(x)g(x) = x°+ x° = x", whereas 


(f(g) () = f(g) = GP = x”. 
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We may also form the product af of an arbitrary real number a and real- 
valued function f. The product function is defined by 


(af)(x) = af(x). 


EXAMPLE |. Let functions f and g be defined by f(x) = x — 2 and g(x) = 
x* — 5x + 6. Dtaw ame graphs of f, ¢, 2f, and f+. We compute the 
function values corresponding to several different numbers x in Tables 3 and 
4. The resulting graphs of f and g are, respectively, the straight line and 


(a) Figure 18 (b) 


parabola shown in Figure 18(a). It turns out that the graphs of 2f and 
f+ gare also a straight line and a parabola. They are drawn in Figure 18(b). 
To see why the graph of f+ gis a parabola, observe that 


(f+ sx) =f) + 8) = — 24+ (2? - 5x4+ 6 =x -— 4x44 
= (x = 2)*. 


It follows that f + g is very much like the function defined ye pion: 
Instead of simply squaring a number, f + g first subtracts 2 and then squares. 
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Its graph will be just like that of y = x? except that it will be shifted two units 
to the right. 


TABLE 3 TABLE 4 

x | f(x) | 2f(x) x | g(x) 
0O| —2 —4 0 6 
1| -1 —2 5 6 
2 0 0 =| —Z 
3 1 2 l Z 
4 2 


Up to this point we have used the letters f, g, h, F, G, and H to denote 
functions, and the letters x, y, a, b, and c to denote elements of sets—usually 
real numbers. However, the letters in the second set are sometimes also used 
as functions. This occurs, for example, when we speak of x as a real variable. 
As such, it not only is the name of a real number but also can take on many 


different values: 5, or —7, or 7, or .... Thus the variable x is a function. 
Specifically, it is the very simple function that assigns the value 5 to the 
number 5, the value —7 to the number —7, the value 7 toz,.... For every 


real number a, we have 


xta) = a. 


This function is called the identity function. 

Suppose, for example, that s is used to denote the distance that a stone 
falling freely in space has fallen. The value of s increases as the stone falls 
and depends on the length of time ¢ that it has fallen according to the equa- 
tion s = 4g??, where g is the constant gravitational acceleration. (This 
formula assumes no air resistance, that the stone was at rest at time ¢ = 0, 
and that distance is measured from the starting point.) Thus s has the value 
2¢ if t has the value 3, and, more generally, the value $ga” when ¢ has the 
value a. If we consider ¢ to be another name for the identity function, then s 
may be regarded as the function whose value is 


s(a) = 3ga” = 4¢(t(a))’ 


for every real number a. The original equation s = 3g?’ then states the rela- 
tion between the two functions s and ¢. The fact that s and ¢ take on different 
values is also expressed by referring to them as variables. A variable is simply 
a name of a function. In our example s is called a dependent variable, and 
t an independent variable, because the values of s depend on those of f 
according to s = 4gt*. Thus an independent variable is a name for the identity 
function, and a dependent variable is one that is not independent. 
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A real variable is therefore a name of a real-valued function. Since the 
arithmetic operations of addition, subtraction, multiplication, and division 
have been defined for real-valued functions, they are automatically defined 
for real variables. 

We shall generally use the letter x to denote an independent variable. 
This raises the question: How does one tell whether an occurrence of x 
denotes a real number or the identity function? The answer is that the 
notation alone does not tell, but the context and the reader’s understanding 
should. However, a more practical reply is that it doesn’t really make 
much difference. We may regard f(x) as either the value of the function 
f at the number x or as the composition of f with the variable x. If x is an 
independent variable, the function f(x) is then the same thing as f. 


EXAMPLE 2. The conventions that we have adopted concerning the use of 
variables give our notations a flexibility that is both consistent and ex- 
tremely useful. Consider, for example, the equation 


p= iy — 3x. 


On the one hand, we may consider the subset of R’, pictured in Figure 19, 
that consists of all ordered pairs (x, y) such that y = 2x? — 3x. This subset 


y-axis 


Figure 19 


is a function f whose value at an arbitrary real number x is the real number 
f(x) = 2x? — 3x. Alternatively, we may regard x as an independent 
variable, i.e., the identity function. The composition of f with x is then the 
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function f(x) = 2x? — 3x, whose value at 2, for instance, is 


(f(x))Q) = f (xQ)) = fQ) = 8 — 6 = 2. 


A third interpretation is that y is a dependent variable that depends on x 
according to the equation y = 3x? — 2x. That is, y is the name of the 
function 3x? — 2x. 


EXAMPLE 3. Let F be the function defined by F(x)=x°>+x+1. If 
u= /x — 2, then 


Fu) =wv+u+1 
= (x — 2 4 (— P+ 1. 
If we denote the function F(x) by w, then 
utw=vVvx—24+>x 4x41, 
uw = (x — 2)7?O° + x + 1). 


On the other hand, we may let G be the function defined by G(x) = Vx — 2 
for every real number x > 2. Then G + Fand GF are the functions defined, 
respectively, by 


(G + F)(Xx) 


GF F(x) 
=x —247 +28 
(GF)(x) = G(x)F(x) = (x — 2)" + x + 1). 


To say that a is a real constant means first that it is a real number. 
Second, it may or may not matter which real number a is, but it is fixed for the 
duration of the discussion in which it occurs. Similarly, a constant function 


y-axis 


Figure 20 
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is one which takes on just one value; i.e., its range consists of a single ele- 
ment. For example, consider the constant function f defined by 


Sieh =. 3; it Gk Gases 


The graph of fis the straight line parallel to the x-axis that intersects the y-axis 
in the point (0, 5); see Figure 20. We shall commonly use lower-case letters 
at the beginning of the alphabet, e.g., a, b, c,..., to denote both constants 
and constant functions. 


EXAMPLE 4. Consider the function ax + b, where a and 5b are constants, 
a ~ 0, and x is an independent variable. The graph of this function is a 


Sere oe . b 
straight line that cuts the y-axis at b and the x-axis at — —. It is drawn in 
a 


Figure 21. This function is the sum of the constant function b and the func- 
tion which is the product of the constant function a and the identity function x. 


Figure 21 


PROBLEMS 
1. Let functions fand g be defined by 
FOS ea SO Sa Bh eyS 
Find A(x) if 


(ay ee = fle) iG) k= fg 
ts +e ce) b= se. 
icy en 2) 


2. What is the domain and range of each of the functions f and g in Problem 1? 
What is the domain of each of the functions h? 


3. If f(x) = x + 1 and g(x) = x — 1, plot the graph of the Gaston 
4 
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at. 


£2. 


ee 
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. Plot the graph of the composite function F(g), where F and g are the functions 


defined by g(x) = x — 2 and F(x) = in 
x 


_ If f, g, and / are functions, show that f(g(4)) = (f(g))(A). This is the Asso- 


ciative Law for the Composition of Functions. 


. If fis a real-valued function, how would you define the function 3f? How would 


you define Vf ? 


. The velocity v of a freely falling body depends on the distance s that it has 


fallen according to the equation v = \/2gs, where g is the constant gravita- 

tional acceleration. 

(a) Using an s-axis and a v-axis, plot the dependent variable v as a function 
of the independent variable s. 

(b) Ifs depends on the time ¢ according to the equation s = 5gt?, how does v 
depend on ¢t? 

Note that the variable v in (a), which depends on 5s, is not the same function as 

the variable v in (b), which depends on t. Without knowing which is referred 

to, the meaning of the value of v at 2 is ambiguous. 


I{f¥w=wtutiwu = x?+ 2, andv = x — 1, what is the value of each 


of the following functions at an arbitrary real number x? 


(a) u+tuv (b) w+o (c) wu. 


A rey a ee 
x 


(a) (F(u))(x) = 
(b) F(w()) = 
(Cc) Ut wy) = 


The equation y = 2x + 1 defines y as a function of x. It also defines x as a 
function of y. Describe the latter function in two ways. 

Draw the graph of the function f(x) = ax — 1 for four different values of the 
constant a. 

If fand g are two real-valued functions, give the definitions of the sum f+ g 
and the product fg in terms of ordered pairs. 

Let fand g be two real-valued functions. In terms of domain f and domain g, 
what are: 


(a) domain f(g)? (b) domain (f + g)? (c) domain fg? 


4. Limits and Continuity. Consider the function f defined by 


Kod: Gee 


ee x 2.2, 


f(x) = 


The domain of f is the set of all real numbers with the exception of the 
number 2, which has been excluded because substitution of x = 2 in the 
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expression for f(x) yields the undefined term 8. On the other hand, 
x? — 3x + 2 = (x — 1)(* — 2) and 


ove? =x 1, provided x # 2. (1) 
The proviso is essential. Without it, (1) is false because, if x = 2, the left 
side is undefined and the right side is equal to 1. We therefore obtain 


etx) x = 1, x € 2. 


The graph of the function x — 1 is a straight line L; so the graph of f is the 
punctured line obtained from L by omitting the one point (2,1) (see 
Figure 22). 


Figure 22 


Although the function f is not defined at x = 2, we know its behavior 
for values of x near 2. The graph makes it clear that if x is close to 2, then 
f(x) is close to 1. In fact, the values f(x) can be brought arbitrarily close to 1 
by taking x sufficiently close to 2. We express this fact by writing 


ee 
xr— 2 x — Z : 
2 — 3 
which is translated: The limit of e = is 1 as x approaches 2. 
Xe — 
x— V3 ; x—-vV3. 
EXAMPLE 1. Evaluate lim ene . The function vas is not de- 
ty) sie Be, 


fined at x = 3. The following algebraic manipulation puts the function in a 
form in which its behavior close to 3 can be read off easily: 


ee xe 

x-—3 ae a ad x-3Vx4tv3 
: 1 
BLY a OATS 


) ux oe 5. 
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Again note the proviso x ~ 3: When x = 3, the last quantity in the pre- 


l 
ceding equations is equal to —=———_ , but the first quantity is not defined. 
& eq q eee q y 
However, by taking values of x close to 3, it is clear that the corresponding 
l | 

values of EE RYE can be brought as close as we please to AE We 
conclude that 

Axes tN 

lim = 

r38 x — 3 2/3 

x- V3 l 
In words: The limit of vane , aS x approaches 3, is WE) ; 


| 
EXAMPLE 2. If f(x) = —, evaluate lim f(x). The function fis not defined at 
x 


Zs 0 


O (i.e., the number 0 is not in the domain of f). From the graph of f and the 
list of ordered pairs (x, f(x)) shown in Figure 23, it is clear that there are 
values of x arbitrarily close to 0 for which the corresponding values of f(x) 
are arbitrarily large in absolute value (see Table 5). We conclude that 


a. ; 
lim — does not exist. 


z30 X 
TABLE 5 
l 
x f(xy) == 
3 
RS 
Oi F-20 
0.01 | 100 
0.001 | 1000 
0.0001 | 10000 
—1| -1 
—0.1 | —10 
—(0.01 | —100 
—0.001 | —1000 


—Q.0001 | — 10000 


Figure 23 


Thus far our examples have been confined to the problem of finding the 
limit of a function at a number which happens to lie outside the domain 
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of the function. If it happens that the number a is in the domain of f, then 
it is frequently possible to determine lim f(x) at a glance. Consider, for 


t— a 


example, the function f(x) = 2x? — x — 2. As x takes on values closer and 
closer to 3, the corresponding value of 2x? approaches 18, the value of —x 
approaches — 3, and the constant —2 does not change. We conclude that 


Pi ak Deed, 
ae 


or that, for this particular function, lim f(x) = f(3). 
x—3 


EXAMPLE 3. It would be incorrect to suppose that if a is in the domain of f, 
then it always happens that lim f(x) = f(a). Consider the two functions 


f and g defined by 
xr+1 if ier So, 


Ps if x= %, 
x? + 1 ie sv, 
—x’? — 1 if OE 


F(x) -{ 


g(x) = | 


y-axis 


g(x)=x’°+ 1, 
t=O 


(a) (5) 
Figure 24 


Both these functions are defined on the whole real line; i.e., domain f = 
domain g = R (see Figure 24). Furthermore, 


F(0) = 2 and g(0) = 1. 


30 FUNCTIONS, LIMITS, AND DERIVATIVES  [cuHap. 1 


As x approaches 0, however, it is clear that x? + 1 approaches I| and not 2. 
Hence 


lim S(x) = 14 f(O). 


[Note that in computing lim f(x), we consider values of f(x) for all x arbi- 


trarily close to a but not equal to a. This point will be made explicit when we 
give the formal definition.] Turning to g, we see that the value of g(x) near 0 
depends on whether x is positive or negative. For any small positive number x, 
the corresponding number g(x) is close to 1, but if x is smal! in absolute 
value and negative, then g(x) is close to —1. Since there is no reason to 
prefer numbers of one sign to those of the other, we conclude that there is 
no limit. Thus 


lim g(x) does not exist. 
2-70 


The reader may feel that Example 3 loses force because the functions 
used to make the point were in some sense artificial. There is some truth in 
the objection. Recall, however, that one of our major objectives is to reduce 
the class of all functions to those we wish to study in this course. After 
defining lim f(x) precisely, we shall turn our point of view around and use 

w~— a 


this definition as the major tool in the problem of deciding what does con- 
stitute a well-behaved function. 
The conceptual problems in trying to give an exact meaning to the 
expression lim f(x) = 5 revolve around phrases such as “‘arbitrarily close,” 
t—a 


“sufficiently near,” and “‘arbitrarily small.’’ After all, there is no such thing 
in any absolute sense as a small positive real number. The number 0.000001 
is small in most contexts, but in comparison with 0.000000000001 it is huge. 
However, we can assert that one number is smaller than another. Moreover, 
the actual closeness of one number x to another number a is just the distance 
between them: It is |x — a|. One way to say that a function f takes on values 
arbitrarily close to a number 5 is to state that, for any positive real number e, 
there are numbers x such that | f(x) — b| < e. Weare stating that no matter 
what positive number € is selected, 10'’, or 1071’, or 107 ?”’, there are num- 
bers x so that the distance between f(x) and b is smaller than e. Thus the 
difficulty inherent in the phrase ‘“‘arbitrarily close’? has been circumvented 
by the prefix “for any.” To finish the definition, we want to be able to say 
that f(x) is arbitrarily close to b whenever x is sufficiently close, but not equal, 
to a. What does “‘sufficiently close’? mean? The answer is this: If an arbi- 
trary € > 0 is chosen with which to measure the distance between f(x) and 
b, then it must be the case that there is a number 6 > O such that whenever 
x is in the domain of fand within a distance 6 of a, but not equal to a, then 
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the distance between f(x) and b is less than e. The situation is pictured in 
Figure 25. First € > 0 is chosen arbitrarily. There must then exist a number 
5 > 0 such that whenever x lies in the interval (a — 6,a+ 6), andx ¥a 
then the point (x, f(x)) lies in the shaded rectangle. We summarize by giving 


Figure 25 


the definition: Let f be a real-valued function of a real variable. Then the 
limit as x approaches a of f(x) is 6, written 


lim f(x) = 5, 


if, for any € > 0, there exists 6 > 0 such that whenever x is in the domain 
of fand0 < |x — a| < 6, then | f(x) — b| < e. (There is a strong tradition 
for using the Greek letters € and 6 in the definition of limit. In fact, the part 
of calculus that deals with rigorous proofs of the various properties of 
limits is sometimes referred to as “‘epsilonics.’’) 


EXAMPLE 4. The idea behind a formal definition can sometimes be grasped 
most easily by looking at an example where the condition is not satisfied. 
Consider the function g defined in Example 3 whose graph is drawn in 
Figure 24(b). We shall prove that lim g(x) # 1. To do this, we must es- 


20 


tablish the negation of the limit condition: There is an € > O such that, for 
any 6 > O, there is a number x in the domain of g such that 0 < |x| < éand 
g(x) — 1| > €. There are many possible choices for e. To be specific, take 
€ = 4. We must now show that for every positive number 6, there is a non- 
zero number x in the open interval (— 6, 6) such that the distance between 


6 
g(x) and 1 is greater than or equal to 4 (Figure 26). Take x = — 5: This 
number is non-zero, lies in (— 6, 6), and furthermore 


go) = e(-5)= ~F Le 1 


Hence ots) — 1) > 2 > &. 
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ee 


This distance is 
greater than + 


Figure 26 


The basic limit theorem is the following: 


(4.1) THEOREM. I/f lim Fix). = by. and ae g(x) = bo, then 
(1) lim [f(%) + gQ)] = bi + br. 
(ii) lim ef(x) = cbs. 
(it) lim f(x)g(x) = bide. 


(iv) lim oe 2 a provided b, # 0. 
ra 2 


The proofs are given in Appendix A. They are not difficult, and (1) and 
(ii) especially follow directly from the definition of limit and the properties 
of the absolute value. Some ingenuity in algebraic manipulation is required 
for (ii1) and (iv). Note that we have already assumed that this theorem is 
true. For example, the assertion that lim (2x? — x — 2) = 13 1s acorollary 

xr 3 

of (i), (ii), and (iii). 

If a function f is defined for every x in R and if its graph contains no 
breaks, then it is apparent from looking at the graph that lim f(x) = f(a). 


Logically, however, this intuitive point of view is backward. So far, we have 
constructed the graph of a function f by plotting a few isolated points and 
then joining them with a smooth curve. In so doing we are assuming that if 
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x is close to a, then f(x) is close to f(a). That is, we are assuming that 
lim f(x) = f(a). Now that we have given a formal definition of limit, we 


w~—a 


shall reverse ourselves and use it to say precisely what is meant by a function 
whose graph has no breaks. Such a function is called continuous. The 
definitions are as follows: A real-valued function f of a real variable is 
continuous at a if a is in the domain of f and lim f(x) = f(a). The function 


f is simply said to be continuous if it is continuous at every number in its 
domain. 

A continuous function is one whose graph has no breaks, but the 
graph need not be a smooth curve. For example, the function with the saw- 
tooth graph shown in Figure 27 is continuous. The reader should also realize 


y-axis 


X-axis 
Figure 27 


that many functions that are not continuous fail to be so at only a few isolated 
places. Thus the function fin Example 3, whose graph is drawn in Figure 24(a), 
has its only discontinuity at 0. It is continuous everywhere else. Finally, we 
emphasize the fact that there are two conditions in the definition of con- 


3 3 
x° — 1 ae 
= 3. the function 


sede a 
tinuity. Even though lim 1 is not continuous 


ea a 
at x = 1 simply because it is not defined there. 
If two functions f and g are continuous at a, then it is not difficult to 
prove that the sum f + g is also continuous at a. To begin with, a is in the 
domain of f + g since we have (f + g)(a) = f(a) + g(a). Furthermore, we 
know that lim f(x) = f(a) and that lim g(x) = g(a). It follows by Theo- 


rem (4.1)(i) that 
lim [f0) + g@)] = f@) + 8(@). 
Since f(x) + g(x) = (f+ g)(x), we get 
lim (f + g)(x) = Cf, Be), 
which proves the continuity of f + gata. The other parts of the basic limit 


theorem (4.1) imply similar results about the products and quotients of 
continuous functions. We summarize these in 
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(4.2) THEOREM. If two functions f and g are continuous at a, then so are 
ao & 
(ii) cf, for any constant c. 
(i) fg. 
(iv) > provided g(a) ¥ 0. 
A real-valued function f of one real variable is called a polynomial if 


there exist a nonnegative integer m and real numbers dp, a, .. . , a, such that, 
for every real number x, 


F(x) = do tax tees + a,x". 
The following functions are all examples of polynomials: 
f(x) = 2 — 4x + 3x’, 
f(y) = 1 Ty" + By + 7, 
I(x) = x, 
f(x) = 5, 
2(s) = (s? + 2)(s* — 1) = 5’ 4 2s? — 8? — 2. 


It is equally important to be able to recognize that a given function is not a 
polynomial. Examples of functions which are not polynomials are 


f(x) = |x, 

fe) => 

FG) =a ae OT 
f= Ve 


F(y) = ( — 1)”. 


Algebraically the set of all polynomials is much like the set of integers: 
The sum, difference, and product of any two polynomials is again a poly- 
nomial, but, in general, the quotient of two polynomials is not a polynomial. 
Moreover, the algebraic axioms (i) through (v) listed in Section 1 also hold. 

Just as a rational number is one which can be expressed as the ratio 
of two integers, a rational function is one which can be expressed as the ratio 
of two polynomials. Examples are the functions 


fo) = =. 
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gs) = xt = 5 
a a xi + 2x +1 ; 
ex) = =z. 
The domain of every polynomial is the entire set R of real numbers. 
Similarly, the domain of a given rational function om , where p(x) and q(x) 


are polynomials, is the whole set R with the exception of those numbers x 
for which g(x) = 0. Furthermore, we have 


(4.3) Every polynomial is a continuous function, and every rational function 


x 
a, is continuous except at those values of x for which q(x) = 0. 


q(x) 


Proof. The identity function x is clearly continuous, and so is every constant 
function. Since every polynomial can be constructed from the identity 
function x and from constants using only the sums and products of these and 
the resulting functions, it follows from Theorem (4.2) that every polynomial 
is continuous. The assertion about the continuity of rational functions then 
follows from part (iv) of Theorem (4.2). 


It is occasionally useful to modify the definition of lim f(x) to allow x 


to approach a from only one side: 


When this is done, we speak of either the limit from the right or the limit 
from the left and write either 


lim f(x) or lim f(x), 
x—a+ bey ee 


according as the additional condition is x > aor x < a. Thus for the func- 
tion 

x — 1, a 

f(x) = 

x? — 1, eZ, 
whose graph is shown in Figure 28, the limit of f(x) as x approaches 2 does 
not exist. Nevertheless, we obtain 

lm. fi) =", in F(x) ee 3. 
Sat aaa 


2-92-+ 
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Similarly, for the function g in Figure 24(b), we have lim g(x) = 1, 
Y ea 


lim g(x) = —1. 
f-0—* 
y-axis 
(2,3) 
pox, x22 
y=x’-l, ex? (2,1) 
x-axis 
Figure 28 
The graph of the rational function 
Se 
1) ee x 0, 


together with a list of some of the ordered pairs (x, a (x)) that comprise f/f is 
shown in Figure 29. From both Figure 29 and Table 6 it is clear that as x 
increases without bound, f(x) becomes arbitrarily close to 1. We express 
this fact by writing 


-axiS 
ities ae el im 
r— +0 x 
TABLE 6 
x f(x) 
he et 
OO 11 
100 | 1.01 
1,000 | 1.001 
1.000001 


1,000,000 


Figure 29 
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Since f(x) also becomes arbitrarily close to 1 as x decreases without bound, 
1.€., aS —x increases without bound, we write 


ee 
x a 


lim 
The definition is as follows: Let f be a real-valued function of a real variable. 
Then the limit of f(x) is b as x increases without bound, written 
“um ((x) = p, 
L— +00 
if, for any € > 0, there exists 6 > O such that whenever x is in the domain of 
fand 6 < x, then|f(x) — b| < €. Theanalogousdefinition for lim f(x) = b 


is obvious. 


The symbols +-co and —o can also be used to refer to the behavior 
of the values of the function as well as the independent variable. If, as x 
approaches a, the corresponding value f(x) of the function increases without 
bound, we may express the fact by writing 

lim f(x) = +o. 

The reader should be able to attach the correct meanings to the various 

other possibilities: 
lim f(x) = —a, 


r~—a 


lim f(x) = +o, 


Har $(X) = — 06, etc: 
rat 
It is essential to keep in mind that +0 and —« are not numbers. They 
are not elements of R. They are used simply as convenient abbreviations for 
describing the unbounded characteristics of certain functions. The symbol 
+o (or simply «) in an expression for a bound will always mean that the 
quantity referred to increases without limit in the positive direction. Similarly, 


ee Sees aoe age 
— # always indicates the negative direction. Thus we shall not say lim — = o. 


r—0 X 


But we do say 


lim —= oo, 
20+ xX 

] 

lim — = —o, 
z—30— X 
lim j—| = oo, 
zr 0 |X 
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PROBLEMS 


1. Compute the following limits. 


(a) lim~— (f) lim we 1 = 1 
ea 1 r—>0 Xx 
2 
(6). fini ape ee (g) lim |x| 
xr—?2 a Z re 
, 2 2 
i rae ol Obi _ (1—h , 6h —1 
5 | oe (h) im ( h2 +) 
2 , : 
(d) lim (5 is 4) @) sdifig hh) 36 oe) 
zo1 \X — 1 x—1 cu x 
* 2 
(ce) lim x*” 6) rime = eee 


z—0 h=s0 h 


2. Classify the following functions as continuous or discontinuous. If dis- 
continuous, at what numbers a is the function not continuous? 


(a) x +3x-—1 (f) A(x) = = 
a x} 
(b) f(x) = [al (e) f(x) = a “| ool 
x +x+1 , if x is rational 
oe p= ge? oe if x is irrational 
7 = fea ( eo Oy | if x # 1 
ee w) fey = f ifx = 1. 


(ec) Vx +3 


3. A function fis said to have a removable discontinuity at a if it is not continuous 
at a, but can be assigned a value f(a) [or possibly reassigned a new value f(a)] 
such that it becomes continuous there. 


(a) Locate the removable discontinuities in Problem 2. 


(b) Show that the only discontinuities a rational function can have are either 
removable or infinite. That is, if r(x) is a rational function that is not 
continuous at a, show that either a is a removable discontinuity or 
lim |r(x)| = +0. 


~t— a 


4. Using Theorem (4.1), prove that if lim f(x) = 5; and lim g(x) = do, then 


lim [f(x) — g(x)] = 41 — do. 


t— a 
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5. Show that 


10. 
i. 
12. 


13. 
14. 


(a) lim f(x) = bifand only if lim f (*) =, D, 


xr— +00 t—>0+ 


(b) lim f(x) = dif and only if lim f (*) a= Db; 


rI——o t0— 


. Using Problem 5, compute 


1 ie Agr s7eb. 
ue ova. 
= yeti that a ee ee 
—s iil 


. True or false? 


(a) If lim f(x) = 5, then lim f(x) = band lim f(x) = b. 


ra za 2—a— 
(b) If we f(x) = band + tt = 5, then bed Tay BD. 

. Define a function f and draw its graph such that lim /f(x) = 2 and 
- Sf (x) = 0. — 

. Compute 
©) Maa es 
(0) lim Say = 10 @) tim 21. 


Does the set of rational functions satisfy axioms (i) through (vi) in Section 1? 
(Hint: Be careful; note Problem 3.) 
Give the formal definition in terms of inequalities of lim f(x) = b. 


z—a+ 
Define a function f and draw its graph such that lim f(x) ¥ lim f(x), 


although both limits exist. 
Prove that it is impossible to choose a rational function in Problem 12. 
Give the formal definition in terms of inequalities and absolute values of 


(a) lim f(x) = & () tm f(x) = +o. 


rI—a I—>—e 


5. Straight Lines and Their Equations. We shall define a straight line in R? 
to be any subset L consisting of all ordered pairs (x, y) such that 


ax + by+c= 0, where a? + b? > 0. (1) 
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The inequality a? + b? > 0 simply says that the constants a and 6 are not 
both equal to zero. Of course two different equations can define the same 
line. For example, the set of all ordered pairs (x, y) such that4x — 3y +5 = 0 
is the same line as the set of pairs for which 28x = 2ly — 35. For this 
reason, we speak of an equation of a straight line and not the equation. 


(5.1) Suppose that straight lines L, and L, are defined, respectively, by 


Aix oe biy + Cc; = 0, a + by? => 0, 
Axx + boy + co = 0, a,” + by? > 0. 


Then L, = Ly if and only if there is a nonzero constant k such that 


Aap = kay, 
by = kb, 
Co = ke. 


Proof. If such a k exists, then the two equations are equivalent, and so L; = Ly». 
Conversely, suppose that L; = Ly. We may assume without loss of generality 


that b; # 0. Then the point (0 -2) lies on L, since it satisfies the first 
1 


equation; 1.e., 


ay°0 + bo (— 5°) + en = 0. 
by 


Hence 


Ng oe 1 Se 
) lies on LZ; because 


In addition, the point (1 
1 


ay + by oe +c = 0. 


This point then also lies on Lo, and this fact means that 


ay + be a. + co = 0. 
1 


ie: ie, ee 22), 
PIS pas ey oI ee ep 


Since by = (7) b, trivially, we obtain the desired conclusion by setting 


Hence 


. 


k = —. Note that k ¥ 0, for if it were zero, we would get a. = bo = 0, 
u 


contrary to assumption. 
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One consequence of Theorem (5.1) is that it enables us to recognize at a 
glance whether or not different equations define the same straight line. 
Another corollary arises in connection with the following definitions: A line 
L defined by an equation ax + by + c = O witha? + b? > Owill be called 
vertical if b = 0 and horizontal if a = 0. It follows from the theorem that b 
must equal zero for every such equation which defines a vertical line and 
that a must equal zero for every such equation which defines a horizontal 
line. Thus the definitions are not dependent on the particular equation 
which defines L. 

If P = (a,b) and Q = (c,d) are two points in R’? and a #¥c, the 
slope of the line segment joining P to Q is, by definition, 


Note that 


= = m(Q, P). 


a o£ 


me, oy - ¢=* b—d 


The absolute value of m(P, Q) is the ratio of the vertical to horizontal distance 
between P and Q (see Figure 30). It is simply a measure of steepness. A 


Figure 30 


segment with positive slope goes up as it goes to the right; one with negative 
slope goes down as it goes to the right (Figure 31). If a = c, the segment is 
vertical, and the slope is not defined. 


(5.2) Let L be the straight line defined by the equation ax + by +c = 0, 
where b #0. If P and Q are any two distinct points on the line, then 


m(P, Q) = — 


SS 
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Proof. Let P = (x1, y1) and Q = (x2, ye). An equation equivalent to the original 
one is 
a C 
en ee a 


It follows that x, ¥ Xo, since, otherwise, substitution in this equation would 
yield y; = yo, which would then imply P = Q. We obtain 


a Cie 2 ¢ 
ae ee 
m(P, Q) = ———* = ———___— = - = 
bs eee | Xo Xi b 
and this completes the proof. 
positive slope negative slope 


Figure 31 


As a result of Theorems (5.1) and (5.2), we can unambiguously define 
the slope of a nonvertical line L, which we shall denote by mz , as follows: 
For any pair of distinct points P and Q on L, we define 


It follows at once that mz, depends only on the line Z. For if P’ and Q’ are 
any other two distinct points on the line, then 


m(P, Q) = — 5 = m(P’, Q’). 


(Since L is not vertical, b # 0.) Furthermore, any other equation defining L 
can be written kax + kby+kc=0 with k #0, and, of course, 


—-—=- : . We note that the slope of a vertical line is not defined. 


EXAMPLE |. Find an equation of the straight line L through the point (a, b) 
and with slope m. If (x, y) is any other point on the line, then 


PP a 18 
aA 2 
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This is an equation of the line. For suppose L were defined by some equation 
ax + biy + ci = 0. An equivalent equation is 


= (*) x - 1 
, ie by be” 


fy a 
oy (3) 


which implies 


Qa, 


or, since = =, 
by 


; C 
Since we are given that (a, b) lies on L, we get b = ma — = , or 
1 


7 


Ci 
— = ma — b. 


by 


Substitution in (3) yields y = mx — ma + b, which is equivalent to (2). 


Suppose that S is an arbitrary subset of R? with the following three 
properties: 


(i) S contains a point (a, b); 1.e., S is a nonempty set. 


(ii) The slope m(P, Q) is defined and is equal to the same fixed number 
m, for every pair of distinct points P and Qin S. 


(iii) S contains every point (x, y) in R? which is connected to (a, 5) 
by a line segment of slope m. 


These are certainly the geometric properties of a nonvertical straight line. 
It follows from (i) and (ii) that the coordinates of every point (x, y) in S 
satisfy the equation 


y—b= mx — a). (4) 


Conversely, it follows from (iii) that, for every pair of real numbers x and y 
which satisfy (4), the point (x, y) must lie in S. Thus the set S is the graph 
of (4), and, as such, it is a straight line. Since nonvertical straight lines, as 
we have defined them, have the above three properties, we conclude that our 
definition coincides with the natural geometric one. 

We define two lines L; and L; to be parallel if they are both vertical or if 
they have the same slope. 
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The following fact, which we shall prove later using trigonometry, can 
also be deduced from Figure 32 by the methods of plane geometry. 


y-axis 


Figure 32 


(5.3) Two nonvertical lines Ly, and L2 with slopes m, and my», respectively, are 
perpendicular if and only if mm, = —1. 


EXAMPLE 2. (a) Write an equation of the straight line LZ, that passes through 
(—2, 4) and (3,7). (b) Write an equation of the line LZ. passing through 
(5, —2) and parallel to Z;. (c) Write an equation defining the line L; that 
passes through (— 1, —3) and is perpendicular to LZ. 


T— 43 
The slope of the segment joining (—2,4) and (3, 7) is wee 
An arbitrary point (x, y) other than (3, 7) belongs to LZ, if and only if 
hee 7 ce 
Cee ee 


Hence an equation defining L; is 5(y — 7) = 3(x — 3), or, equivalently, 
3x — 5y + 26 = 0. 
The line LZ, also has slope 2. Since it passes through (5, —2), it is defined by 


ge ae 


ae eS; 


or, more generally, by 5(y + 2) = 3(x — 5), which is equivalent to 


ax = Sy = 2 =e, 
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The slope of the perpendicular is —3. Hence we obtain the equation 


ees = 
ae x # —l, 
or 3y + 9 = —5x — 5, as an equation of L;. 


What functions have graphs that are straight lines? The answer is an 
easy one. If fis defined by 


f(x) = ax + 8B, =—f.<— xX <- @, 


then its graph, which is the set of all ordered pairs (x, y) such that y = ax + Bb, 
is certainly a straight line. Conversely, if the graph of an arbitrary function f 
is a straight line, then the equation y = f(x) is equivalent to one of the form 


ax+ by+ ca = 0, Gee Ge > De (5) 


If by were zero, both points (-" ; 0) and (-2 ; ) would satisfy (5), but 
1 


1 
the definition of function makes this impossible for the equation y = f(x). 
We conclude that b; # 0 and that (5) is therefore equivalent to 


It follows [see Theorem (2.4), page 15] that the functions f(x) and 


Re (*) — are equal. Thus the functions whose graphs are straight lines 
1 1 


are precisely those of the form ax + b. These are the polynomials of degree 
less than 2, the linear functions. 


PROBLEMS 
1. For each of the following lines, find an equation that defines it. 
(a) The line through (2, 3) with slope 1. 
(b) The line through (0, 1) with slope 1. 
(c) The line through (0, 1) with slope —2. 
(d) The line through (—1, —3) with slope —3. 
(e) The line through (—2, 1) and (—1, —1). 
(f) The line containing the points (1, 0) and (0, 1). 


(g) The line through the origin containing the point (1, —19). 
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(h) The line with slope 0 that passes through (3, 4). 
(i) The line through (2, 5) and (2, 8). 


. Draw the line defined by each of the following equations, and find the slope. 


(a) x+y=1 (ape 263 
(o) x= —y (e) Ty =3 
(c) 2x — 4y = 3 (f) 4x + 3y = 10. 


. Determine whether P, Q, and R lie ona line. If they do, draw the line and write 
an equation for it. 


(a) P= (0,0), QO = (-1,3), R = @G, —4). 

O)o P= Gi Dou = GrmpieRiaitegosy 

(c) P= (@,a@2), Q = (1,52), R = (€1, €9). 

. Draw the set of all ordered pairs (x, y) such that 

(a) 4x? + day yy? & I2e > by FO ee a = 0 
(b) 5x? + Txy + 2y? + 3x + 3y = (x + 2y + 3x + y) = 0. 


. The x-coordinate of a point where a curve intersects the x-axis is called an 
x-intercept of the curve. The definition of a y-intercept is analogous. 


(a) Find the x- and y-intercept of the line defined by y — 3x = 10. Draw the 
line. 


(b) Write an equation for the line with slope m and y-intercept equal to db. 


. For each of the following equations, define the function f(x) whose graph is the 
set of ordered pairs that satisfy the equation. Which ones are linear functions ? 


(a) .. 34. 7 = 7 (e) y? + 2x + 3 = 0 (two functions) 
(b) Sy = 3 GO) iS a ye 

(c) 2|x| + 3y = 4 (g) y = 3x? + 4x42 

(d) x-—y=1 (h) 5x + 3y = 1. 


. Among the lines defined by the following equations, which pairs are parallel 
and which perpendicular? 


(a) 4x + 2y = 13 (d) y= —2x 
(b) 3x — 6) = 0 ke). 4e,— 13 
(c) 3x +2y = 6 (f).. ay = 13. 


. (a) Write an equation of the straight line L; that contains the points (1, 3) and 
(3, —2). 


(b) Write an equation of the line with x-intercept 1 that is parallel to LZ}. 
(c) Write an equation of the line perpendicular to L, that passes through (1, 3). 


. Prove that the two lines L; and L, in Figure 32 are perpendicular. (Hint: Use 
congruent right triangles or the converse of the Pythagorean Theorem.) 


SEC. 6] THE DERIVATIVE 47 


6. The Derivative. The concept of the line tangent to a curve at a point is an 
important one in geometry. However, it is not so simple an idea as it may 
first appear. Consider the graph of a function f and a point P = (a, f(a)) on 
the graph, as illustrated in Figure 33. Many people who would have little 
difficulty drawing the line tangent to the graph at P would not find it easy to 
give an accurate definition of the tangent line. For example, to say that the 
tangent line at P is the line which cuts the graph at the single point P, although 
true for some curves, is obviously not correct in general (in particular, see 
Figure 33). We shall show that the problem of defining the tangent line to the 
graph of f at P can be expressed in purely analytic terms involving the func- 
tion f. In fact, the problem leads directly to the definition of the derivative 
of a function, the central idea in differential calculus. 


P=(a,f(a)) 


Figure 33 


Let ¢t be an arbitrary nonzero real number, and consider the point 
Q(t) = (a+ 4, f(a + 1), which, together with P = (a, f(a)), lies on the 
graph of f (see Figure 34). The slope of the secant line ZL; containing P and 
Q(t) is equal to 


m(P, Q(t)) = ee (1) 


If ¢ is small in absolute value, then L; is an approximation to what we shall 
define to be the tangent line. The smaller the value of |¢|, the better the 
approximation will be. In some sense, therefore, we would like to define the 
tangent line L to be the limit, as t approaches zero, of the lines L;. We can 
do this, for although we have not defined a limit of lines, we have defined 
limits for functions, and hence we can express the limit of the slope of L;. 
According to equation (1), it is given by 

fat)-f@. i 


lim m(P, Q(t) = lim —————— 
t—+0 t—0 t 
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We shall define the tangent line to the graph of fat P to be the line through P 
having this limit as its slope, provided the limit exists. 

Leaving the geometric interpretation aside for the moment, we observe 
that the value of the limit in (2) depends only on the function f and on the 
number a. Hence we give the following definitions: An arbitrary real-valued 


y-axis 


O(t)=(at+t, f(a+t)) 


a P=(a,f(a)) 


X-axis 


Figure 34 


function f of a real variable is differentiable at a number a in its domain if 


pe 


t—0 


exists (i.e., is finite). The derivative of f at a, denoted f’(a), is this limit. Thus 


ACE Sem ie 
i 


J (ay scm 


t— 0 


If f is differentiable at every number in its domain, it is simply called a 
differentiable function. 

Thus the slope of the line tangent to the graph of f at the point (a, f(a)) 
is equal to the derivative f’(a). It follows that an arbitrary point (x, y) lies 
on this line if and only if 


y—f@=f/'@ — a), 


and we therefore obtain the following equation of the tangent line: 


y=f{@t+f@e — a). 
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Note that the only variables that appear in this equation are x and y, and 
these occur with exponent 1. The equation therefore defines y as a linear 
function of x. 


EXAMPLE |. Find the derivative of the function f(x) = x? + 2 at x = 2, 
and write an equation of the line tangent to the graph of f at the point (2, 6). 
As we have seen above, the slope of the tangent line is the derivative f’(2), 
and 
f’Q) = lim pi J). 
t—0 t 

We have f(2)=6, and f2+H)=Q4+7)4+2=f74+ 44+ 6. 

Hence 
——— = 
So : 
f'(2) = lim (t + 4) = 4. 
t—0 


2 
er ei yslae< if = 0. 


The tangent line passes through (2, 6) and has slope 4. Hence (x, y) lies on the 
tangent if 


es. 4, wae 2. 
x —2 


and we therefore obtain 
y—6= 4& — 2) or 4x -—y—2=0, 


as an equation of the line. 
EXAMPLE 2. Consider the function g defined by 


ft 
Be a 5” 2 2. 


Compute the derivative g’(3). By definition, 
ot )) = 8G). 


6 ae 
t—0 t 
1 
We have g(3) = 2, and g3+ 24 = aes 
ee) 
eo 51(t + 5) 
ae Soe 
= St 5) 
= ift ~0 


50 FUNCTIONS, LIMITS, AND DERIVATIVES  [cuap. 1 
We conclude that 
’3) = lim{ — ee ee — is 
ae S(t + 5)) 20 


EXAMPLE 3. Find F’(a), where a > O and F is the function 


I 
F(x) = — 
(x) V/ 


) Ue ys = oe, 
5 


and write an equation of the line tangent to the graph of F at the point 
(4, 3). By the definition of the derivative, 
Fig) = ti GAOT EO. 
t—0 t 
In this case, 


F(a +) — F@ = 3 ieee Ms was 

f t\Vatt va 
The problem in computing any derivative from the definition is always the 

= “Pte te = i) = 
same. We set up the fraction ee and then compute the limit. 
To begin with, we are faced with a fraction both the numerator and de- 
nominator of which approach zero. The limit we seek is the relative rate at 
which numerator and denominator go to zero. With most examples it is not 
possible to tell from a cursory glance just what that relative rate is. So we 
experiment, performing various algebraic manipulations that hopefully 
will finally change the fraction into a form from which we can tell what the 
limit is. In the present example the following manipulation will do the trick: 


r( 1 ~ J.) - | ve~ wat i vet vert 
i\Watkt A/a t opfawea it (A/a +e 
a ok eG + ft) 
tVaVatt(Vat+ Vat t) 
as a dee Pe: Hina if ts 


Vai + at(Vat+ Vat 2b) 
It is now possible to see what happens as t — 0. 


F(a) = im —— = = Se 


o/b aie a =) tae 
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Our principal interpretation of the derivative F’(a) is that it is the slope of the 
line tangent to the graph of F at the point (a, F (a)). For this particular 
function F, an equation of the tangent line at (4, $) is therefore found by 
writing 

l 


Sr 
SS i = oS 


ae 
x —4 16 


Hence an equation of the tangent is 


The notation f’(a) for the derivative suggests that we regard f’ as a new 
function whose value at a is the number f’(a). The domain of f’ is the set of 


all real numbers a for which jim EB RO 
t—0 


view, it is natural to think of the derivative evaluated not only at an arbitrary, 
but fixed, number a but also at a variable x. In so doing, we are admitting 
the same dual interpretations that were discussed in Section 3. That is, we 
can interpret f’(x) either as the value of the function f’ at the number x, 
whence 


exists. With this point of 


f(s) = tim LHD = SO), 


t+ 0 


or as the composition of the variable x with the function /’. 


EXAMPLE 4. If f(x) = x* — 1, plot the graph of the derived function /’. 
For any real number x, 


f'(x) = lim 


Iie st) — fy) 
{ 


We have 
fx+)-f@=(@+9-1I)-@-) 
= 3x°t + 3x + 2, 
and so 
fs Se) z = 3x +304, ifis 0. 
Consequently, 


f'(x) = lim Gx? + 3xt + 1°) = 3x’. 
t— 0 
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The graph of the function f’(x) = 3x? is the parabola shown in Figure 35, 
on which the graph of the original function f(x) = x* — 1 has also been 
drawn. 


y-axis 


f(x) = 3x? 


ny X-axis 


fey=x* -1 


Figure 35 


It 1s not surprising that there are several common notations for the 
derivative of a function. One strong tradition reflects the basic fact that the 
derivative is the limit of a ratio by writing it as a ratio. Thus 


dy 
dx ~ J 


This way of writing the derivative is called the differential notation. Using it, 
we denote the derivative of f at a by 


F(a) = f(a. 


EXAMPLE 5. Let f(x) = x* — 1. It was shown in Example 4 that f’(x) = 3x”. 
Fach of the following equations is an example of acceptable notation. 


| een a 
yO) = 322 ae 12, 
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Le aay age 


d 
One could also write a (x) = f'(x) = 3x’. There is no need for it, how- 


ever, since f’(x) becomes identified with f’ when it is regarded as the com- 
position of the independent variable x with the function /’. 


4g : 
It should be emphasized that although the notation a suggests a ratio, 
x 
the derivative as we have defined it is not a ratio—even though it is the limit 
a ae 
of one. a is simply an abbreviation of /’. 
x 


There are a few variations on the two notations that we have given for 
the derivative which we shall also use frequently. If y = f(x), we may write 
any one of 


for the derivative. Similarly, for the derivative at a real number a, we have 
d ) d 
y'(@) = F(a) = fa) = FO. 


Still other notations for the derivative, which we shall seldom use, but which 
the reader may encounter in other books are 


Df = D,f = Dy = Dry = y, 


where it is assumed that y = f(x). 


EXAMPLE 6. It follows from the computation in Example 3 that if 


F(x) = x_?, x > 0, then the derivative is given by F’(x) = —4x 2”, 
If we write y = x_"/*, x > O, the derivative is also written 
ee 
es em Sir 
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The slope of a straight line is the ratio of a change in y to a change in x. 
It therefore measures the rate of change of y per unit change in x for the 
ordered pairs (x, y) that make up the line. Consider the two lines defined by 
y = 10x — 3 and y = x — 3 respectively. The rate of change of y to x is 
10 for the first and 1 for the second. For a function whose graph is not a 
straight line, however, the concept of the rate of change of y, or f(x), with 
respect to x 1s more profound. There is the problem that the change in func- 
tional values f(x) per unit change in x will not be constant along the graph. 
More basic, however, is the question of the precise meaning or definition 
of the rate of change. The answer is provided by the derivative. Since 
f’(@ is the slope of the line tangent to the graph of f at the point (a, f (a)), it 
measures the rate of change of f(x) with respect to x at that point. In 


y-axis 
yu x 42 


2 x-axis 


Figure 36 


Example 1 we showed that if f(x) = x? + 2, then f’(2) = 4. We interpret 
the number 4 not only as the slope of the line tangent to the graph of f at 
(2, 6) but also as the rate of change of f(x) with respect to x there. From the 
picture of the graph in Figure 36 it is apparent that at (2, 6) a small change in 
x produces a corresponding change four times as great in f(x). 

In Section 4 the idea of limit was introduced by examples and by ex- 
ploiting the reader’s intuitive understanding of continuity and continuous 
curves. We then gave a formal definition and proceeded in terms of it to go 
back and define continuity precisely. We shall do an analogous thing here 
and now define the slope of the graph of f at the point (a, x (a)), or more 
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simply the slope of the curve y = f(x) at (a,f(a)), to be the derivative 
f'(@). 


We conclude this section with the theorem 


(6.1) If a function f is differentiable at a, then it is continuous there. 


Proof. The hypothesis that 
lim oo 0) 


t—0 t 


exists implies tacitly that a is in the domain of f. If a quotient approaches a 
finite limit as the denominator approaches zero, then the numerator must 
also approach zero. This fact is a consequence of the theorem that the limit 
of a product is the product of the limits [see part (ili) of Theorem (4.1), 
page 32]. In this case, we have 


lim [f(a + t) — f(a)] = lim eed , ‘ 
i220 t—0 


tae tei -lim t 
t—0 t t—>0 
= [4a Oo = 0, 
The equation lim [f(a + 1) — f(a)] = 0 is equivalent to 
t—0 
lim fa+t = f@. (33 


If we set x = a + ¢, then x approaches a as t approaches 0, and conversely. 
So (3) becomes 


lim f(x) = f(a), 


t—a 


and the proof is complete. 


PROBLEMS 
1. Let f(x) = 3x? + 4. Using the definition of the derivative, compute 
fay f°) (b) f’(a), for an arbitrary real number a. 


2. Write an equation of the line tangent to the graph of the function fin Problem 1 
at the point 


(a) (1,7) (b) (a, f(a). 


yt F(X) = - Z re compute F’(3) using the definition of the derivative. 
x 
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4. Using the definition of the derivative, compute f’(a) for each of the following 


functions. 
(a) f(x) = x" Gy “f6y Sa ee 
(b) f(x) = x°+3x45 (:) faavVve+1 
= 1 
ie. faye 9 i) sfaS Tra 
(d) f(x) =Vx, a>0 iy Fe = x". 
1 
ey? Big x #0 


5. Using the results of Problem 4, find an equation of the line tangent to the 
graph of fat the point (a, f(a)), where 


(a) 
(b) 
(c) 
(d) 
6. (a) 
(b) 
7 AB) 


(b) 


Ji) = 2 Meda = 0. 
f(x) =  +.3x% + Sanda = 4. 
f(x) = 7 and a is arbitrary. 


1 
f(x) =xt+ = and a is not zero. 
x 


If F(x) = x”, use the definition of the derivative to find F’(x). 
Plot the graphs of F and F’ on the same xy-plane. 


Show that the function |x! is not differentiable at 0 and interpret this fact 
geometrically. 


Compute the derivative at —1 and at 1 of the function |x|. 


8. Show that the function \/x is not differentiable at 0. Draw the graph and 
interpret the nondifferentiability geometrically. 
9. Using the results of Problem 4, find 


(a) 


(b) 


(c) 


(e) 


df —— 
oe (—1) if fx%) = x + 3x+5. 


i eae oe 
r (3).10. fi) Hix 


af 
dx 
dy/x2 +1 
dx 
d(x” + 3x + 5) 
dx 


(b) if f(x) = x +3x° 4+ 3x 41. 


(a). 
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(f 4(«+4). 


10. (a) If y =2x + 4, ond = (a). 


(b) If s = 169’, find S (2). 


») ds 
(c) Ifs-= 467, find = 


11. Using the definition of the derivative, prove that if y = ax? + bx +c, then 
bed = 2ax + Bb. 
dx 


12. Give an example of a continuous function that fails to have a derivative at 
some point. 


7. Derivatives of Polynomials and Rational Functions. Computing /’(x) 


t — 
from the definition of the derivative by evaluating jim FOF can 
t—-0 


be quite a job. In this section we shall develop a set of theorems from which 
the derivatives of many functions, including all polynomials and rational 
functions, can be found easily and, what is more important, in a completely 
routine way. 


(7.1) If f and g are differentiable functions, then their sum f + g is differentiable. 
Moreover, (f + g)’ = f’ + g’. 


Proof. Let a be a number in the domain of f + g. Recall that by the definition of the 
sum of two functions 


(f+ 2)@ = f@ + e@), 
(f+aa@t+to=fatot eat dv. 
Hence, by the definition of the derivative, 


of kay gyn OG tO Vege 


t—0 t 

in LEO + se + 0 - YO F s@) 
t— 0 t 

ae (= =. 2 lac ie). 
t—0 
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It follows from the existence of f’(a) and g’(a) and the fact that the limit of 
a sum is the sum of the limits [see the basic limit theorem (4.1)(i)] that we 
may continue the above sequence of equalities, writing 

BEE Ea: 2 = ie rae gla + : — g(a) 


t—+0 t— 0 


= f'(a) + g(a) 
= (f’ + 2’)@. 


This completes the proof. 


(7.2) If f is a differentiable function and c is a constant, then cf is differentiable 
and Gs y = cf’. 


Proof. For any number a in the domain of f, we have (cf)(a) = cf(a). Hence 


(cf)'(a) = lim ele FOIE) 
= lim cfa + 9) — cf@ 
t—0 t 


= tim (¢ Ae 9— FO), 


£50 t 


By the basic limit theorem (4.1)(ii) and the assumed existence of f’(a), we 
can continue the chain of equalities, writing 
tie i= Fa) 


= clim— 
t—0 t 


cf'(a) = (cf)@). 


This completes the proof. 


By taking c = —1, we get as a corollary of (7.1) and (7.2) that 
Cf eh = oe 2 
(7.3) The derivative of any constant function is the constant function zero; 1.e., 


Cad, 


Proof. Recall that we allow ourselves the liberty of denoting a real number and the 
constant function whose value is that real number by the same letter. Doing 
so here, we have 


Clay = cla + fy =<, 
for any numbers a and t. Hence 


c'(a) = lim ca + t) — c@) _ lim 2 ee: 
t—+0 t rae eS 3 
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EXAMPLE 1. Let f(x) = x*, and g(x) = /VWx+1(x> —1), and A(x) = 
x? + 3, and suppose we are given the information that 


f'@y = 3x’, 
l 
(x) = ———_? XS —l, 
a 2/x+ 1 
ity = 2x. 


It follows from the three theorems developed so far in this section that the 
derivatives of the functions 


(ay 5x. 2s/x.-+ 1, 
os. 
(6y3a> + 139x747, 


are, respectively, 


(a’) 15x° — ot 
Vx+1 
(b’) 2x, 
(ce) 9x". + 26x, 
For example, to get (b’), we write x? in the form (x? + 3) — 3. Then 
y= CO + 3) — F = 2x — 0 = 2x. 


The others are equally routine. 


The next theorem deals with the derivative of the product of two func- 
tions and its conclusion is perhaps unexpected. Note that it does not turn 
out that the derivative of a product is the product of the derivatives. 


(7.4) If f and g are differentiable functions, then their product fg is differentiable. 
Moreover, (fg)’ = f'g + g'f. 


Proof. Let a be a number in the domain of fg. By the definition of the product of 
two functions we have 
(fg)(a@) = f(ag(a), 
(fg\a+ ) = f(at+ dg + 2). 
Hence 
— im LE + D8G@ + DY — f@s@) | 


t=0 t 
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The following algebraic manipulation will enable us to put the above fraction 
into a form in which we can see what the limit is: 


flat+ Ngat+ 1) — f@g@ = flat dgsat) —f@gsatd 
+ f@e@ +.) — f@e) 
= wae elisa + 
+ ieee ) ~ 2@i@. 
Thus 
e+) te ee fa)| 


t 


cy ae ee ee 
(fg) (a) = a7 a eS 


The limit of a sum of products is the sum of the products of the limits. 
[Again, see the limit theorem (4.1).] Moreover, f’(a) and g’(a) exist by 
hypothesis. Finally, since g is differentiable at a, it is continuous there [see 
Theorem (6.1)]; and so lim g(a + ft) = g(a). We conclude that 

t—0 


ae im (atone lim g(a + 0 


350 


< him sa so] ae 


= (ore sere = e+ oye 


This completes the proof of the product rule for differentiation. 


EXAMPLE 2. Suppose we are given the information that the functions f(x) = 
(x? + 2) and g(x) = (x? + 2)’ have derivatives 


f'(x) = 6x(x? + 2), 
- ox) Se 2. 


Find the derivative of f(x)g(x) = (x? + 2). Theorem (7.4), which is some- 
times called Leibnitz’s Rule, states that 


(f(x)g(x))’ = f(g) + 8’) f (>). 


Hence 


((x? + 2)%)! 


6x(x? + 2)?(x? + 2)? + 10x(x? + 2)4*(x? + 2)? 
16x(x? + 2)’. 


The graph of the identity function x is the straight line defined by the 
equation y = x, which passes through the origin and has constant slope 1. 
It follows that the derivative of the identity function is the constant function 1. 
Thus 

x = 1, (1) 
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We can apply the product (Leibnitz’s) rule and obtain 
(ae tery = vx + xX = LX 4 lx = 2x. 


Since x* = xx”, and we have just found the derivative of each factor, we can 
use the product rule again to get 


(ee A eee AND TH 26x 
= 3", 
Again, 
eS eS ex = Ix? + 3x? x 
a 


These results suggest not only the statement of the next theorem, but 
also how to prove it. 


(7.5) If xis the identity function and nis a positive integer, then (x")' = nx". 


Proof. We have already proved the theorem for n = 1. (Actually we have also 
proved it for n = 2, 3, and 4, but for the moment this is irrelevant.) Suppose 
we had proved it for all positive integers up to and including k. In par- 
ticular, we would know that (x*)’ = kx*—1!. We could then use the product 
rule to derive (x**!)’ = (xx*)’ = x'x* + (x*)'’x = Ix* + kx*1-x = 
(k + 1)x*. Thus the theorem is true for n = 1, and if it is true for n = k, 
it is then also true forn = k + 1. We conclude that the theorem holds for 
every positive integer n. This is an example of a proof by mathematical 
induction. The reasoning can be paraphrased like this: Suppose I know that 
I can get on the bottom rung of a ladder. Suppose further that, if I am stand- 
ing on any rung, then I can reach the next rung. It follows that I can climb 
the ladder. 


EXAMPLE 3. Find the derivatives of the polynomials: 


loa ea = 2, 
g(x) = 3x°+ 7x — 13, 
y = 4x* + 3x? + 2x’ + x, 
s = 3gf (g is a constant, and ¢ 
is an independent variable). 


The answers are immediate: 
f'(x) = 3x’, 
g(x) = Oxi 7; 
y = 16x? + 9x? + 4x 4+ 1, 


/ 


s’ = gt. 
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It should be clear that, as a result of the rules developed so far, the deriv- 
ative of any polynomial function can be computed immediately and in a 
purely mechanical way. We turn next to the derivative of a ratio. 


(7.6) If f and g are differentiable functions, then the quotient is differentiable 
[if g(a) = O, then (4) (a) is not defined]. Moreover, 
& 


(2) = 


ee 


g? 
Proof. We first prove that the function — is differentiable at a number a in its domain 
& 


provided g(a) # 0. By definition, 


S (1) (a +1) - (*) (a) 
(:) (a) = lim eae ae cd 


t—0 t 
See 
ype SE TD: 
t— 0 t 


Note that since g is continuous at a [see Theorem (6.1)] and g(a) ¥ 0, we 
know that g(a + ft) ¥ 0 for sufficiently small values of ft. Since 


1 1 
ga+t) g(a) _ g(a) — sath 
t tg(a)g(a + 1) 


ie —— & $A - #0), 
g(a)g(a + 1) t 


() © fee 
3 t—+0 g(a)g(a + f£) t 


The derivative g’(a) exists by hypothesis, and lim g(a + #) = g(a) ¥ 0. 
t—0 


we have 


The basic limit theorem (4.1) therefore implies that 


Don ee tS Se 
() 2 (=z lim g(a + 5) (iim i ) 


gs ae a 
(g(a)? (g(a) 


This proves the differentiability of the function — and also establishes the 
following special case of the quotient rule: 


(2 ees = 
(*) e ‘= 
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The general form of (7.6) can now be obtained using the product rule: 
1 1\/ 

pbs p (2 
& & 


— — = er ey hee 
3 g? g? 


OQ ISK 
Bia, et 
I 
Pee” 
Sy 
ee 
| 


This completes the proof. 


EXAMPLE 4. Find the derivatives of the following rational functions: 


2 
x +1 
ee 
: eS ae ae 
sy) = esl : 
l 
==,” 
u—a 
age (a and b are constants and u 


is an independent variable). 


Applying our six rules, we get 


x? x? 
a G*® — Dy — 3) — GO" — 3y + 13 
ge ew 
2 ee oo Eee 
(y= tPF 
— 35” 3 


et | 
See (u — by ~ Gu — by 


It is important to realize that the symmetry present in the product rule is 
missing in the quotient rule. For the former, order is immaterial: The prime 
appears once on one factor and once on the other, and that is all there is to 
remember. This is not so for the quotient rule, however, where the wrong 
order will result in the wrong sign in the answer. There is no help for it but 
to memorize the formula precisely. 
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The formula for the derivative of x” has been proved only if 7 is a non- 
negative integer. (It holds for n = O because x° = 1.) The next theorem 
enlarges the scope of the formula to include all integers. 


(7.7) If x is the identity function and n is an integer (positive, negative, or zero), 
en (ey == ax. 
Proof. We shall assume that n is a negative integer, since the theorem is known to be 


eh a 1 
true otherwise. Then m = —n is a positive integer, and x” = 23 Using 
(2) and (7.5), we get - 


/ ™my?1 m1 
ey = (4) --G- = 


x™ 
= (=fr ~~ = as 
This completes the proof. 

Thus, for example, if f(x) = x~’, then f’(x) = —7x~*. In Section 8 
we shall show that the formula is actually valid, not only for integers, but for 
any rational number nv. Finally, in Chapter 5 we shall prove that (x%)’ = 
ax*—', for any real number a. 

Let us summarize in a single list the theorems that we have developed 
for finding derivatives. To provide practice, we shall this time employ the 


es : 
alternative rs notation. Let u and v be differentiable functions of x, and ca 
x 


constant. Then 


Note that we have proved these theorems for arbitrary differentiable func- 
tions uw and v, not just for polynomials and rational functions. 
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EXAMPLE 5. Let 


y=2x° + Ix+1 
ae 
xd 
3f + 2t+ 1 
Ss = 
t— 4 
Then 
dy 2 
Fe + 7, 
du 6 5 
ge 
Os. oe ta + 2) = Gr meet ae Wed on = a 
dt — (t — 4)? a 4)? 


We have seen in this section that the derivative of a polynomial is another 
polynomial, and the derivative of a rational function is a new rational 
function. Once we have found the derivative f’ of any function f, we can go 
on and find the derivative of f’. The new function, denoted f’’, is called the 
second derivative of f. Clearly, 


f(a) = tim FEF OR LO, 


The third derivative, written f’’’, is the derivative of the second derivative, 
and, in principle, we can go on forever and form derivatives of as high order 
as we like. It would obviously be absurd to write the seventeenth derivative 
with seventeen primes, so we adopt the alternative notation f ™ for the nth 
derivative of /. 

The differential notation for the higher derivatives is based on the idea 


q, ; ; 
that es is a function, sometimes called an operator, which assigns to a func- 
x 


tion its derivative with respect to x. Hence we write 


df se 
dx d () re 


= eo 
dx ¢ (2) - ee Po 


af _ pm 
as : 


TL (q) = f(a), ete 
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In addition, if a variable is used to denote a function, for example, if y = f(x), 
we also use the expressions 


EXAMPLE 6. Let f(x) = x3 + 3x2+ 1. Then 
t'(x) = 3° + Gx, 
F''(x) = 6x + 6, 
IO) # 6, 
fae ee > 3. 


As another example, let y = ———. Then 

xe 

Woe «to saber 

dx : (x+ 1) 

oe ew 

dx? (x + 1)3 

qd? os 

dxn ~ (x + Iyer a 
PROBLEMS 


1. With the aid of the rules for differentiation given in this section, compute 


df 


f= = for each of the following functions. 
3 


a) fb = Sf Sa ) fe) =5— 

(b) f(x) = *@ +1) (g) f(x) = o> 
3 

(c) f(x) = x°@ + 2 (h) f(x) = os 


2 
(d) f(x) = @° — 40° + 2x4+3) @ f= € ~ ‘) 


@) fx) = 2° +5, G) fo) = @ +1) 
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2x + 1 2 
(K): SO) = ames Gay fly Gt x) 
d f%)=@° +1)” (n) f(x) = (& — a(x — b)(x — o). 


2. Determine an equation of the line tangent to the parabola y = x? — 4x + 5 
at the point (1, 2). Draw the parabola and the tangent line. 

3. The parabola y = ax? + bx + c passes through (0, 4) and is tangent to the 
line 2x + y = 2 at the point (1,0). Find the coefficients a, 6, and c for the 
parabola. 

4. Show that if f, g and / are differentiable functions, then 


ey = (eh +h ane. 


5. What is the correct product rule for differentiation, analogous to the one in 
Problem 4, for (a) four factors, (b) n factors? 
6. Obtain an equation of the tangent line to the graph of the function 


ia 


x? +] 


at the point where x = 2. 
te Z , 
7. Cayoae Fiz) — 2s + 2 + —> then f'(2) Sapte ete 


(b) TE fie) = 4.2 + 4 then /"(x) elmer 


(c) a a 
x—1 dx 


(qd) ify = » den” (2) ae 
x dx 


x +1 df 
(e) lf f(x) = —~,—’ then — (a) = 
2 dw 
(f) Ifw = 3u' + 4u + 2, then = 


8. The parabola y = ax? + bx + ¢ is tangent to the tine y = 4x +7 at the 
point (—1, 3). In addition, a (—2) = 0. Find the coefficients a, b, and c. 
x 


9. For each of the following functions f, compute the derivative f’ and the second 


derivative f’’. 
(a) f(x) = 3x7 + 2x41 (dd) f(t) = *(? — 1) 
a oe ©) saath 


2 


4 3 9 
© fM=5+E+Ft¢xt1 fe) = 


68 


10. 


ft. 


12. 


rs 


14. 
15, 
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The line y = 3x — 1 is tangent to the graph of the function f(x) = ax*® + 
Ye 

bx* + cat the point (1, 2). Furthermore, = (1) = 0. ‘Compute a, b, and c. 
x 


2 


3 2 ay 
oe =e ae ge oe 
(a) If fx)=x —-x +x-I1,t en 
x—I1 d’y 
(b) Ify = ese a then 7a = 
(c) If s= at’ + bf + ct + d, where a, b, c, and d are constants, compute 
d's 
dt? 
1 d° 
(d) If y ome me then = (a) ee 


3 
Find all points on the graph of the function . — x? at which the tangent line 


is perpendicular to the tangent line at (1, —4). 

There are many examples of a function f and a number a such that f(a) is 
defined (a is in the domain of f) but f’(a) does not exist. Another way of saying 
the same thing is that the domain of f’ can be a proper subset of the domain of 
f. It is equally possible for f’(a) to be defined and f’’(a) not to be. Let f be the 


function defined by 
ye 


= if ¢> 0, 
f(x) = 2 

a ifx <0. 
(a) Compute /’. 


(b) Is fa differentiable function? [That is, does f’(a) exist for every real 
number a?] 


(c) Show that f’’(0) does not exist, and compute f’’(x) for x ¥ 0. 


Same as Problem 13 except that f(x) = x*4/°. 
(a) Draw the graph of the function g defined by 


x, eB 
g(x) = 
2x — l, : ae zs 
(b) Compute g’ and g’”’. 


(c) Are g and g’ differentiable functions? 


8. The Chain Rule. The theorems in Section 7 were concerned with finding 
the derivatives of functions that were constructed from other functions using 
the algebraic operations of addition, multiplication by a constant, multiplica- 
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tion, and division. In this section we shall derive a similar formula, called the 
Chain Rule, for the derivative of the composition f(g) of a differentiable 
function g with a differentiable function f. Before giving the theorem, we 
remark that an alternative way of writing the definition of the derivative of a 
function f is 


fla) = tim LOPS LO). a 


The substitution x = a + ¢ will transform (1) into the expression that we 
have heretofore used for the derivative. An equation equivalent to (1) is 


eee f(a) - s@| = 0. 


If we set 


r(x) = LOL) _ pra, oF 


then lim r(x) = 0. The quantity r(x) is the difference between the slope of 


ta 


the line segment joining (a, f(a)) to (x,f(x)) and the slope of the line 
tangent to the curve y = f(x) at (a,f(a)). Two equations equivalent to (1) 
and (2), and which together simply constitute another way of saying that 
f(a) is the derivative of f at a, are 


7) — f(a) = [f'(@).+ roi — a), 
oy = 6. ce 


wa 


We shall use these in proving the following: 


(8.1) THE CHAIN RULE. If f and g are differentiable functions, then so is the 
composite function f(g). Moreover, [f(g)! = f’(g)g’. 


Proof. Let a be a number in the domain of g such that g(a) is in the domain of f. 
By definition 


[f(g)l(@) = lim LENO) — S@)I@ 


=> 2 tie 

fam £6) — f(e@)) 
ts ee eet 

I xX —a@ 


The intuitive idea behind the Chain Rule can be seen by writing 
. ae =i ffgle) 20) — _ 
im | == 
ae gi) = sla) oe 
c fie) = 7 ie) | ee = ve 
= | lim ———JH—————_ | | lim —-———__ 
ca g(x) =r g(a) tO Boo 


[f(g)I'(a) 


70 
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Setting y = g(x)andb = g(a) and noting that y approaches b as x approaches 
a, we have 


[f(g)I'(@) = lim Sa Bee. lim g(x) — g(a) 


y—b (lee C0 x — a 
= f'(b)g’(a) 
= f'(g(a))g'(a) = (f"(g)g')(a), 


which is the desired result. This argument fails to be a rigorous proof 
primarily because there is no reason to suppose that g(x) — g(a) ¥ 0 for all 
x sufficiently close to a. To overcome this difficulty, we use equations (3). 
These equations, with a typical element in the domain of f denoted by y 
instead of x and with the derivative evaluated at b, become 


fo) - f®@) = FO + rOv@ — 4), 
lim rly) = 


y—b 
Substituting y = g(x) and b = g(a), we get 


K(gx)) — f(e@) = [/'(e@) + r(g@)] [ged — g@). 
Hence 


SE) = FCO) _ (ea) + r(¢0)] ss) - = #@), 


fae” a 


We know that lim moe 


ra ce 


at a, it is continuous there Tse Theorem (6.1)], and so ms g(x) = e(@ =o 


= g’(a). In addition, since g is differentiable 


Since lim r(y) = 0, it follows that |r(y)| can be dehé Pagbitradle small by 


yb 
taking y sufficiently close to 6. Because lim g(x) = 5, we may therefore 


conclude that lim r(g(x)) = 0. The basic limit theorem (4.1) asserts that the 


limit of a sum or product is the sum or product, respectively, of the limits. 
Hence 


Lf(@)N'(a) = lim a a 


ct =e 


= | lim f'(g@) + =e 1 r(etx)) | lim -— a 


ta 


Zim ——_ 
a 


= [f’(g(a)) + Olg’(a) = f'(g(a))g'(a) 
= (f(g)g’ (a), 


and the proof of the Chain Rule is complete. 
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EXAMPLE |. If F(x) = (x? + 2)’, compute F’(x). One way to do this problem 
is to expand (x* + 2)? and use the differentiation formulas developed in 
Section 7. 


F(x) = (x? + 2)? = x®° + 6x* + 12x? + 8, 
F'(x) = 6x°® + 24x? + 24x. 


Another method uses the Chain Rule. Let g and f be the functions defined, 
respectively, by g(x) = x’ + 2 and f(y) = y*. Then 


f(g(x)) = G? + 2)? = FOX), 
and, according to the Chain Rule, 
F'(x) = [f(e(x))V¥ = f’(g(x))g’(). 
Since g’(x) = 2x and f’(y) = 3y?, we get f’(g(x)) = 3(x? + 2)? and 


PC) = 3(x* + 2)°(2x) 
= 6x(x* + 4x? + 4), 


which agrees with the alternative solution above. 


EXAMPLE 2. Find the derivative of the function (3x’ + 2x)!”°. In principle, 
we could expand by the binomial theorem, but with the Chain Rule at our 
disposal that would be absurd. Let g(x) = 3x’ + 2x and f(y) = y'*®. Then 
g’(x) = 21x° + 2 and f’(y) = 128y'””. Setting y = 3x’ + 2x, we get 


((3x" + 2x)'*)’ = [f(e(x))’ = f’(e@)e’@) 
= 128(3x? + 2x)27(21x* + 2), 


The above two examples are instances of the following corollary of the 
Chain Rule: If fis a differentiable function, then 


ag ES ae for any integer n. 


To eee et, Fy) —.9". -..Then .F(f/)= f" and .we know that 


F'(y) = ny”". Consequently, (f")’ = [FY = F°C*S’ = nf" f'. A 
significant generalization of this result is 


(8.2) If f is a positive differentiable function and r is any rational number, then 
Cy o: er ie 

The requirement that fis positive assures that f” is defined. A nonpositive 
number cannot be raised to an arbitrary rational power. However, as we 
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shall show on page 267, the requirement that r be a rational number is un- 
necessary. Theorem (8.2) is actually true for any real number r. 


Proof. Let r = —, where m and n are integers, and set h = f’ = f/", Then 


= [3 


kh = (frm = f™, which implies that Gy = ("). Using the aie 
formula for the derivative of an integral power of a function, we get 


ne = ay 


Solving for A’, we obtain 
P Ti 4 —f uel 5 m r1—n -m— 
if = Of ee 


a ry ER 
a FS apes 2 


This completes the proof—almost. Note that we have in the argument 
tacitly assumed that /, the function whose derivative we are seeking, is differ- 
entiable. Is it? If it is, how do we know it? The answer to the first question 
is yes, but the answer to the second is not so easy. The problem can be 
reduced to a simpler one: Jf is a positive integer and g is the function defined 
by g(x) = x'/”, for x > 0, then g is differentiable. If we know this fact, we are 
out of the difficulty because the Chain Rule tells us that the composition of 
two differentiable functions is differentiable. Hence g(/) is differentiable, 
and g(f) = f'/".. From this it follows that (f!/")” is differentiable, and 


; mn 
(fi/rym = fm/n, (When we express r as a ratio — , we can certainly take to 
n 


be positive.) A proof that x!/” is differentiable, if x > 0, is most easily given 
as an application of the Inverse Function Theorem (3.4), Chapter 5 (see 
page 261). However, the intuitive reason is simple: If y = x'/” and x > 0, 
then y” = x, and by interchanging x and y we obtain the equation x” = y. 
The latter equation defines‘a smooth curve whose slope at every point is given 


Se ee 
by the derivative = = nx"—!, Interchanging x and y amounts geometrically 
3 


to a reflection about the kine y = x. We conclude that the original curve 
y =x", x» > 0, has the same intrinsic shape and smoothness as that 
defined by y = x”, y > O. It therefore must have a tangent line at every 
point, which means that x!” is differentiable. 


EXAMPLE 3. If y = x!/”, then 


dy es fi 29 ae | 
deri” ~ xi—iin ee 
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EXAMPLE 4. Find the derivative of the function F(x) = (3x? + 5x + 1)°/, 
If we let f(x) = 3x° + 5x + 1, then Theorem (8.2) implies that 


Py = sia) t's) 
= (3x? + 5x + 1)7/8(6x + 5). 


d ieee , 
With the FE notation for the derivative, the Chain Rule can be written 
x 


in a form that is impossible to forget. Let f and g be two differentiable func- 
tions. The formation of the composite function f(g) is suggested by writing 
u = g(x)and y = f(u). Thus x is transformed by g into u, and the resulting 
u is then transformed by f into y = f(u) = f(g(x)). We have 


du : 

a g(x), : 
a 

du iS St’), 

dy 


? lf (g(x))J’. 


By the Chain Rule, [f(g(x))I' = f’(g(x))g'(x) = f’(wg’(a), and so 


C=) 


The idea that one can simply cancel out du in (4) is very appealing and ac- 
counts for the popularity of the notation. It is important to realize that the 
cancellation is valid because the Chain Rule is true, and not vice versa. Thus 
far, du is simply a part of the notation for the derivative and means nothing 
by itself. Note also that (4) is incomplete in the sense that it does not say 
explicitly at what points to evaluate the derivatives. We can add this informa- 
tion by writing 


& (a) = & (ula) & (a. 
: l dw 
EXAMPEE O. Tt Ww 42 S22 4) 3 and 2 ==) find: =- (QQ). By the Chain 
Rule, ks dx 
dw _ dw dz 
dx dz dx 


= (2z + 2) (- 1). 
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When x = 2, we have z = §. Hence 


dw ie eee 
 Q) = Q-$+2(-b = -2 


EXAMPLE 6. Two functions, which we shall define in Chapter 11, are the 
hyperbolic sine and the hyperbolic cosine, denoted by sinh x and cosh x 
respectively. These functions are differentiable and have the interesting 
property that 


% sinh: x == cosh x, 
dx 
a cosh x. — sui. 
dx 


Furthermore, sinh (0) = 0 and cosh (0) = 1. Compute the derivatives at 
x= Oof 


(a) (cosh x)’, 


(b) the composite function sinh (sinh x). 


By (8.2), we obtain for (a) 


d ae. ad s 
ro (cosh XY = 2 Cosh xX ee Caen x1= 2 cosh x sinh x, 


and so 


(cosh x)"(0) = 2cosh Osinh 0 = 0, 


Part (b) requires the full force of the Chain Rule: Setting u = sinh x, we 
obtain 


a a 4 du 
Fy Sh wu = 7 inh u 
= cosh u cosh x, 
or 
ee sinh (sinh x) = cosh (sinh x) cosh x. 
Hence 


Bc sinh (sinh x)(0) 


= cosh (sinh 0) cosh 0 


= ¢ostYO cosa0 = i 
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PROBLEMS 
1. In each of the following problems find [f(g)]'(x). 
(a) f(y) = y’ and g(x) = x + 1. 
(b) f(y) = y + 2yand g(x) = x° — 2x +2. 
(c) f(y) = y’ and g(x) = 


+ 


and g(x) = x’. 


u 


(dq) fw = 


u 1 


(e) f(x) =x and g(x) = x”? 
ae it 
e+ 
(g) f(x) = g(x) = x + 3x42. E 
2. Find f’ given that 


(f) f(x) = x and g(t) = 


(a) f(x) =(+x°)" Sa 
‘ Ba 2 Vs? +1 
(b) ftir Sx ++ 2x>4- x + 4) 
(c) fi) =@ + 1°? - 3)’ (ORE: Otter reer 
(d) f(x) = Vx? - 1 P 
1 3 (h) f(u) = rey : 
a, 
(©) f(x) = (2 7 ') (« + +) 


1/2 
x! 


4/ 5x3? + 6x2 + 4x 
composite function f(g) in two ways: 


3. if f@) = yo and g(x) = > compute the derivative of the 


(a) By finding f(g(x) ) first and then taking its derivative. 
(b) By the Chain Rule. 


4. 2 = Sy ay ae a | and y = 2x” — 6, dig wad Sy 
dx dx 
5. ify = x’ and x = > compute ay and ay (2) using the Chain Rule. 
Vr+5 dt dt 
2 t— 1 dy 
6. Lety = x +3x+2andx = eye Compute % (2) in two ways: 


(a) By evaluating the composite function y(f) and then by taking its derivative. 
(b) By the Chain Rule. 


7. Prove directly by induction on n without using the Chain Rule that if fis a 
differentiable function and n is a positive integer, then (f”)’ = nf”—1f’. 
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8. Prove as a corollary of the Chain Rule that 


[MeV = f'(g@)e' Mi. 


9. Using Problem 8, show that if w = f(z) and z = g(y) and y = A(x), then 


dw _ dw dz dy | 
dx dz dy ax 
1 pias dw 
10. Letw=z—-—-,z=%/y3+ 1, andy = 2x3 —x+1. oe 
YE, X 


11. Using Example 6, compute 
2 


(a) Fea x) (c) LEE Se (0) 
dx dx2 
(b) a sinh x (d) = sinh(cosh x) 
dx? dx 
12. i z= f/oy and y = 6d, show tea dy a dy 
Se cee es dx dy? \dx dy dx 
i. 2 = 2y° — 3y+1landy = x= 1, compute 7 (2) in two ways: 


(a) By evaluating the composite function z(x) and finding z’’(2). 


(b) Using the result of Problem 12. 


1 
14. Let f(x) be a differentiable function with the property that f’(x) = —. If g(x) 
Xx 


is a differentiable function with the property that its composition with f is the 
identity function, i.e., f(g(x)) = x, prove that g’ = g. 


9. Implicit Differentiation. The subset C of the xy-plane consisting of all 
ordered pairs (x, y) that satisfy the equation 


ys | 
ae (ia 


is the hyperbola shown in Figure 37. It is apparent from the figure that the 
whole set C is not a function, since it is easy to find instances of ordered pairs 
(a, b) and (a, c)in C with b ¥ c. For example, both (6, 2\/3) and (6, — 2/3) 
lie on the curve. On the other hand, many subsets of C are functions. For 
instance, the set of all ordered pairs (x, y) in C for which x > 3 and y > 0, 
which is drawn with a heavy curve in Figure 37, is a function f(x). Central to 
the ideas that follow is the fact that since the points (x, f(x)) that comprise 
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J belong to C, they satisfy the equation of the hyperbola. That is, 
a OD. 
i. > 


: 1, es. 


for every x > 3. We say that the function fis defined implicitly by (1). 

It is geometrically obvious that the hyperbola has a tangent line at every 
point, and we therefore conclude that the function f(x) is differentiable 
except at x = 3, where the tangent is vertical. We can compute f’(x) most 
easily by observing that since (2) holds for every x in the domain of f, it is an 


y-axis 


Gao 


x-axis 


(6, —2/3) 


Figure 37 


equality between two functions. Specifically, the composite function 
ies 
9 4 

equal derivatives. Hence 


is equal to the constant function 1. Equal functions have 


9 ~ 


u —_— ss 


The rules of differentiation yield 
2 2 eae 
me. gi) =», 


and solving for f’(x), we obtain 


iy sete 
PO) = oF) 3) 
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In particular, if x = 6, then f(x) = 2\/3, and 


2078 be 8 
9-2/3 373 
It is important to realize that there is no single function f defined 
implicitly by equation (1). The set of all points (x, y) of C for which y < 0 
is another such function, and it includes the point (6, —2./3). Note that if 


this were the function that we denoted by /, we would still obtain equations 
(2) and (3). For this f, however, we have f(6) = —2,/3. Hence, this time, 


FO = 


: 4-6 4 

£80 aS 
9(— 2/3) 373 

EXAMPLE |. The set of all points (x, y) that satisfy the equation 


5x? — 6xy + Sy? = 8 (4) 


can be shown to be the ellipse shown in Figure 38. What is the slope of the 
line tangent to the ellipse at (0, 2\/2)? It is clear from the figure that the set 


Figure 38 


of all pairs (x, y) on the ellipse for which y > 0 and y > x (drawn with a 
heavy curve in the figure) is a differentiable function f(x). This function is 
implicitly defined by equation (4). Thus 


5x? — 6xf(x) + 5(f(0))? = 8, 


for every x in the domain of f. Since this is an equality between two functions 
we obtain by differentiating both sides, 


10x — 6f(x) — 6xf’(x) + 10f(x)f’(x) = 0. 
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Solving for f’(x), we get 


ag As SX. 

Sf(x) — 3x 

This problem deals with an implicitly defined function whose graph passes 
through the point (0, 2\/2). Hence f(0) = 2\/2 and therefore f’(0) = 3, 
which is the slope of the desired tangent line. 


f(x) = 


The definition, which we have thus far illustrated with two equations, is 
the following: A function f(x) is defined implicitly by an equation F(x, y) = c, 
where c is a constant, if F(x, f(x)) = c for every x in the domain of f. We 
emphasize that, in general, an equation in x and y defines y as a function of x 
in many ways. The most we can hope for in the way of uniqueness is that, for 
a given point (a, b) such that F(a, b) = c, we can choose an open interval 
containing a which is the domain of precisely one continuous function f(x) 
defined implicitly by F(x, y) = c with f(a) = b. 

Note that in both our examples the derivative f’ of the implicitly defined 
function was computed without solving the original equation for f. The fact 
that this is always possible is almost too good to be true—especially for an 
equation where first solving for y in terms of x is either impractical or even 
impossible (except by numerical techniques). This method of finding the 
derivative of an implicitly defined function by differentiating both sides of the 
equation that defines the function is called implicit differentiation. 


EXAMPLE 2. The point (2, 1) lies on the curve defined by the equation 
x*y + xy? = 10. 


Assuming that this equation implicitly defines a differentiable function f(x) 
whose graph passes through (2, 1), compute f’(2). Letting y stand for f(x), 
we obtain by implicit differentiation 


2 dy | 3 2dy _ 
eres ty a. 


Hence 
dy 3xyty 
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EXAMPLE 3. The set of all pairs (x, y) that satisfy the equation 
yi + yx? + ax? — 3ay? = 0 (5) 


—— 5 
is the curve, called a trisectrix, shown in Figure 39. Find = when x = a. 
x 


So stated, the problem is impossible. There are three distinct points on the 


y-axis 
trisectrix 
yi+yx?+ax?—3ay*=0 | 
| 
(a, (1 +V2)a) 
(a, a) 
| 
| 2a 
a Pel (a, 1 —V2)a) er 
ig 
| 
y=-a 
Figure 39 


curve with x-coordinate equal to a, which may be found by substituting 
a = x in equation (5) and then solving for y. The points are (a, a), 
(a, atv 2)a), and (a, ad — \/2)a). As shown in the figure, we may select 
a small interval about a to serve as the domain of three different implicitly 
defined functions. To find the derivative of each one at x = a, we proceed 
by implicit differentiation: 
ay V4 D4 dxy + Dax — bay & = 0. 

Hence 

yy Dy eae 

dx ~ bay — x2 — 38 


Thus the derivatives at a of the three differentiable functions defined implicitly 
by equation (5) are, respectively, 
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dy == eee 

a 2 
y=a(1+ v2) 

dy = e) 

i aoe 
y=a(1— V2) 


The reader should note that in each of the above examples of implicit 
differentiation the existence of an implicitly defined differentiable function 
has either been assumed outright or justified geometrically from a picture. 
The problem of giving analytic conditions which ensure that an equation 
F(x, y) = c implicitly defines y as a differentiable function of x in the neigh- 
borhood of a point (a, 5) is the subject of the Implicit Function Theorem. 
A discussion and proof of this famous theorem may be found in any standard 
text in advanced calculus. 


PROBLEMS 

1. The equation (x? + y’)? = 2(x? — y?) (see Figure 40) implicitly defines a 
ee 

differentiable function f(x) whose graph passes through the point (S ; 5) ; 


2 
PR 
Compute f’ (2) ; 


y-axis 


lemniscate 
Oe yy = 2x? = y*) 


Figure 40 


2. Compute the slope of the tangent line to the circle x2 + y? = 4 at the point 
(1, 3) and at the point (1, —V/3). 
3. (a) The equation x* + y* — 6xy = 0 (see Figure 41) implicitly defines a 
differentiable function f(x) whose graph passes through (3, 3). Compute 
J). 
(b) How many differentiable functions f(x) having a small interval about the 
number 3 as a common domain are implicitly defined by the equation in (a)? 


(c) Compute /’(3) for each of them. 
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y-axis 


(3,3) 


folium of Descartes 


d 
4. For each of the following equations calculate = at the point specified. 


(a) 
(b) 
(c) 


(d) 


(e) 


(f) 
(g) 


(h) 
(1) 
(J) 


xi+y?-—6xy=0 


— 


Figure 41 


x 
4x? + y? = 8, at the point (1, 2). 


y* = x, at the pom (4) 2). 
y? = x°, at one point (1, 1). 
2 | 
y= — - at the point (a, d). 
x al 
y = ea > at the point (a, b). 


x?y + xy? = 6, at the point (1, 2). 
x* + 2xy = 3y’, at the pome(t, B. 


2 
Sy? = x*y + —,» at the point (2, 1). 
XY 


x32 4 y3/2 — 2 at the point (1, 1). 
x? + 3x2y3 + 3x3y2 + y®> = 8, at the point (1, 1). 


X-axis 


5. What is the slope of the line tangent to the graph of y*x? = 4 at the point (2, 1)? 
Calculate y’’(2). 
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6. Each of the following equations implicitly defines y as a differentiable function 
of x in the vicinity of the point (a, 6). Compute “ (a) and < (a). 
(a) x? —y2? =1, GBS) = (V2, 1). 
(b) y? =1—xy, (@,b) = (0,1). 
(c) xy? = 8, “G@pe) = 42 =D). 
(d) x’y® = 1, (a;b)-=-(—4,)). 


CHAPTER 2 


Applications of the Derivative 


1. Curve Sketching. The slope of the tangent line to the graph of a function 
is one interpretation of the derivative, and the rate of change of y with respect 
to x is another. Both interpretations aid us in the sketching of graphs. A 
little practice will show that we need plot relatively few points for a sketch 
if we know the slope of the graph at each of these points. Let us consider the 
function f defined by 


f(x) = 4x3 — 4x? + 12x — 5. 
3 


Its domain is R, and, for each real value of x, we find the corresponding 
value f(x). To help us make the sketch, we look at the derivative: 


f'() = x7 — 8x + 12 = & — 2)& — 86). 


If x < 2, each of the factors of f’(x) is negative, and hence their product 
is positive. Thus the first derivative is positive for each value of x less than 2. 
With the rate-of-change interpretation, this means that the rate of change of f 
with respect to x is positive or that f(x) increases whenever x does. Thus, as x 
increases from — « to 2, f(x) increases. The graph goes up as one moves to 
the right until x = 2. 

If x = 2, then f’(x) = 0, and the tangent, having a slope of 0, is hori- 
zontal. If 2< x < 6, the first factor of f’(x) is positive, the second 
factor is negative, and their product is negative. With a negative rate of 
change, f(x) must decrease as x increases. Thus, as x increases from 2 to 6, 
f(x) decreases. The graph goes down as one moves to the right from x = 2 
tox = 6. 

If x = 6, then f’(x) = 0, and the tangent to the graph is again horizontal. 

If x > 6, both factors of f’(x) are positive, and hence their product is 
positive. Thus f(x) increases as one goes to the right beyond x = 6. Since 
f2Q=4-°8-—4-44 12-2 —5 = and f6) =7° 20 ee 
12-6 —5 = —5, we plot the points (2, 52) and (6, —5). At each of these 
points we sketch a horizontal line segment. An additional point may be found 
by inspection: Since f(0) = —5, the point (0, —5) is also plotted on Figure 1. 

We now know that the graph comes up from lower left through (0, —5) 
to (2, 52), goes down from (2, 52) to (6, —5), and then goes up to the right 
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from (6, —5). But we do not know its shape. Further information on this 
may be obtained from the second derivative: 


oe 8 — 2x — Aj. 


ee 


Figure 1 


If x < 4, then f’’(x) is negative. Since f’’ is the rate of change of f’ with 
respect to x, this means that f’ is decreasing as x is increasing from — @ to 4. 
If we interpret f’ as the slope of the tangent, then this means that the slope 
of the tangent decreases as x increases. We can get some idea of shape here 
if we plot three points on Figure 2(a), the middle one the highest and with a 
horizontal tangent drawn through it. Note that the tangent through the 
middle point has slope less than that of the tangent through the left point 


y-axis 


Figure 2 
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and that the tangent through the right point has a slope which is still less. 
The slopes of tangents at intermediate points will take on intermediate values, 
and thus a curve passing through these three points with these three tangents 
must be concave downward or must ‘‘bend’”? down. Whenever f’’(x) < 0, 
the graph of f(x) will be bending down. The part of the curve through the 
three points of Figure 2(a) with the appropriate tangents is drawn in 
Figure 2(b). 

If x = 4, then f’’(x) = 0, and the rate of change of f’ with respect to x is 
0. Thus the slope of the tangent has ceased decreasing. 

Ifx > 4, then f’’(x) > 0, and/’ increases as x increases. Thus the slope 
of the tangent increases as x increases. Again we plot three points, the middle 
one the lowest and with a horizontal tangent drawn through it. These are 
shown in Figure 3(a). Since the slope increases as x increases, the tangent 


y-axis 


Figure 3 


through the middle point has slope greater than that of the tangent through 
the left point, and the tangent through the right point has a slope which is 
still greater. The slopes of tangents at intermediate points take on inter- 
mediate values, and thus a curve passing through these three points with 
these tangents must be concave upward or must “‘bend” up. Whenever 
f''(x) > 0, the graph of f(x) will be bending up. The part of the curve through 
the three points of Figure 3(a) with the appropriate tangents is drawn in 
Figure 3(b). 

After finding that f(4) = 4:64 —4:-16+4+ 12:4 -— 5 = 4, we are 
ready to sketch the graph in Figure 4. The graph is concave downward 
from the far left through (0, —5) to a high point at (2, 54) and on to (4, 4). 
It is then concave upward from (4, 4) to a low point at (6, —5) and on upward 
to the right. The graph is, of course, incomplete, since it continues indefinitely 
both downward to the left and upward to the right. The point (2, 52), being 
higher than any nearby point on the graph, is called a local, or relative, 
maximum point. It is certainly not the highest point on the graph, hence the 
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word “‘local,”’ or “‘relative.’’ Similarly, the point (6, —5) is a local, or relative, 
minimum. 


y-axis 


f(x) = 4e?-—4x74+ 12x—-—5 
@,99) 


X-axis 


(6, =>) 


Figure 4 


In summary, the graph of a function is concave downward when the 
second derivative of the function is negative and concave upward when the 
second derivative of the function is positive. The graph has horizontal tan- 
gents when the first derivative is 0. The points where the tangent is horizontal 
may be local maximum points or local minimum points or, as we shall see in 
Example 1, points of horizontal inflection. 

It is important to understand clearly the definitions of the various ex- 
pressions used in sketching graphs. The ordered pair (a, f(a)) is a local 
maximum point or a local minimum point of the function f if there is an open 
interval of the x-axis containing a such that, for every number x in that 


interval, S@ZSTCO or f(a) < f(x), 


respectively. As we have indicated, the words relative maximum and relative 
minimum are also used. On the other hand, the pair (a, f(a)) is an absolute 
maximum point if, for every x in the domain of f, 


f(a) = f(), 
and an absolute minimum point if 
f(a) < f(x). 


An extreme point is one that is either a maximum or minimum point (local 
or absolute). If (a, f(a)) is an extreme point, we shall call f(a) the extreme 
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value of the function and shall say that the function has the extreme value at 
a. For example, we say that the function fin Figure 4 has a local minimum 
value of —5 which occurs at x = 6. However, this function has no absolute 
maximum or minimum points. 

Any point (a, f(a)), where f’(a) = 0, is called a critical point of /. 

A point of inflection is a point where the concavity changes sign. Thus 
(a, f(a)) is a point of inflection of the function f if there is an open interval on 
the x-axis containing a such that, for any numbers x; and x, in that interval, 


Fa) fi" (%2) < 0 Cl 


whenever x1 < a and x. > a. The inequality (1) simply says that f’’(x,) 
and f’’(x2) are of opposite sign. A characteristic of a point of inflection of a 
function is that its tangent line crosses the graph of the function at that point. 
The different possibilities are illustrated in Figure 5. Of the points P, Q, and 


y-axis y-axis y-axis 


P=(a,f(a)) Q = (a, f(4)) 


R=(a, f(a)) 


Figure 5 


R, only R is a point of inflection. If a function has a point of inflection (a, f(a)) 
and the second derivative f''(a) exists, then f'’(a) = 0. However, it is possible 
to have a point of inflection at a point where there is no second derivative 
(see Problem 10 at the end of this section). 


EXAMPLE |. Sketch the graph of f(x) = (x + 1)*(x — 1). We first compute 
the derivatives: 


f'oKy = @+ I! 4+ @ = D386 + YD = 6 Eee 
F(x) = (« + 1974 + (4x — 2)2(% +: 1) = 12x(x 4+ 1). 
Setting f’(x) = 0, we obtain x = —1 and x = 4. Thus (—1,0) and 


3, —24) are critical points and the tangents through these points are hori- 


zontal. Setting f’’(x) = 0, we get solutions x = —1 and x = 0. Thus, if 
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there are any points of inflection, they must occur at these two places. It is 
easy to see that the sign of the second derivative for values of x along the 
x-axis follows the pattern 


positive | negative | positive 


—| 0 X-axis 


We conclude that (—1, 0) and (0, —1) are in fact points of inflection. The 
point (—1, 0), being both a critical point and a point of inflection, is a point 
of horizontal inflection. The graph is shown in Figure 6. Note that the graph 


f(x) = («+ 1)*(x-1) 


ee =a) 
Figure 6 
crosses the tangent at the point of horizontal inflection and that the slope of 


the tangent line does not change sign at that point. The first derivative 
(hence the slope of the tangent) increases to zero, as we go from the left to 


x = —I1 and then decreases again through negative values, from x = —1 to 
x = 0. The point (¢, — #24), being the lowest point on the graph, is not only 
a local minimum point but also an absolute minimum point. —¥22 is the 


absolute minimum value of this function. 


In the plotting of graphs of polynomial functions, we are frequently 
helped by knowing that a straight line can cut the graph of a polynomial 
function of the nth degree in at most n points. This is a consequence of the 
algebraic fact that, if p(x) is a polynomial function of degree n, then the equa- 
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tion p(x) = 0 can have at most n distinct real roots. The function in Ex- 
ample 1 may be expanded to show that it is a polynomial of degree 4. It is 
possible to draw a straight line which will cut its graph in four points, but no 
straight line which will cut it in as many as five points. 


EXAMPLE 2. Sketch the graph of f(x) = 1 — x?/*. As before, we find 
derivatives: 

f'@) Ficus 

fx) = Bx. 
For no values of x will we have f’(x) = 0, and so there are no critical points. 
On the other hand, f’(0) is not defined and the graph has a vertical tangent 
at (0, 1). The first derivative is defined for all other values of x and is positive 
for x < O and negative for x > 0. Thus, at each point on the graph to the 
left of the vertical axis, the slope of the tangent is positive, increasing without 
limit as x — O—. At each point on the graph to the right of the vertical axis, 
the slope of the tangent is negative, increasing from negative numbers large 
in absolute value as x increases from 0. The second derivative f’’(x) is positive 
for all values of x except for x = 0, where it is not defined. It follows that 
there are no points of inflection. The graph is nowhere concave downward. 


y-axis 


Figure 7 


Note in Figure 7 that the portion of the graph which lies to the right of 
the vertical axis is the ‘“‘mirror reflection” across that axis of the portion which 
lies to the left of the vertical axis. Note also that f(— x) = f(x). Such a func- 
tion, where f(— x) = f(x), is called an even function and its graph will always 
contain two halves which can be brought into coincidence with each other by 
folding the graph along the vertical axis (Figure 8). The problem of graphing 
an even function is simplified by graphing it for positive values of x and draw- 
ing, for negative values of x, a reflection over the vertical axis of the right half 
of the graph. 
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y-axis 


even function 


Figure 8 


4 
EXAMPLE 3. Sketch the graph of f(x) = x + —. The derivatives are 
x 


mo 4, 
fia =1-< 


The first derivative vanishes for x = 2 and x = —2, and thus we see that 
(2, 4) and (—2, —4) are critical points. The second derivative is negative 
when x is negative, so the curve is bending down at (—2, —4) and that point 
must be a local maximum point. Similarly, f’’(2) > 0 and (2, 4) is a local 


y-axis 


Figure 9 
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minimum point. f(0) is undefined and | f(x)| increases as x — 0. Behavior 
for large values of |x| can be seen, since f(x) approaches x as x increases or 
decreases without bound. 

From Figure 9, one can see the graph approaching the vertical axis as 
x — 0 from either side, and also approaching the graph of the equation 
y = xas|x| increases without bound. Note that the two parts of the graph are 
reflections of each other across the origin, and also that f(—x) = —f(x). 
Any function f for which f(—x) = —/f(x) is called an odd function. The 
graph of an odd function may be obtained by first drawing the graph of 
f(x), where x > 0 (see Figure 10). We may then obtain the remainder of the 


odd function 


Figure 10 


graph by first reflecting this positive part about the y-axis and then about the 
x-axis. The result of reflecting first about one axis and then about the other 
we shall call reflection about the origin. 


Summarizing the techniques of curve sketching, we find the first and 
second derivatives of the function with respect to x, we find the points where 
either derivative vanishes, and we determine critical points and points of 
inflection. Points of general use in graphing f(x) include: 


1. The tangent is horizontal if f’(x) = 0. 


2. The curve is concave downward if f’’(x) is negative, concave upward 
if f’’(x) is positive. 

3. (a, f(a)) is a local maximum point if f’(a) = 0 and f’’(a) < 0, a 
local minimum point if f’(a) = 0 and f’’(a) > 0. 

4. (a, f(a)) is a point of inflection if f’’(x) changes sign as x increases 
through a. 
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5. Even and odd functions need be investigated carefully only for 
x > 0. The rest of the graph of an even function is found by reflection 
across the vertical axis, of an odd function by reflection about the origin. 


PROBLEMS 


1. Sketch the graph of each of the following functions, carefully labeling all 
extreme points and all points of inflection. Classify each extreme point as to 


type. 


(a) x =o eae 
(hb): 3: Se Saat (k) (a? — 2/3? 
(c) 2x°—3x-1 (1) |x| 
(4). 5 328 (m) |x — 7| ‘ 
(e) x — 3x Soe eles 
a 0) x e +3 
(g) xa 2 ee anes 

=o ip) Sessa 
ee (q) (x —1)(@ — 2) & — 3) 
(i) x —x fa) edit ered 


2. Show that a polynomial function of x which consists only of even powers of x 
is an even function. 
3. Show that a polynomial function of x which consists only of odd powers of x 
is an odd function. 
4. (a) Show that the graph of the function ax? + bx + c,a # 0, always has an 
absolute extreme point. 
(b) Which of the constants a, 6b, and c determines the type of extreme point 
of the graph? 
(c) What is the extreme value of ax? + bx + c? 


(d) Write ax? + bx +c as o( = a * x) +c, complete the square on 
a 


x? + ae without changing the function, and find the result of (c) alge- 
a 


braically. 

5. Show that the graph of x* — 12x has a local maximum point but no absolute 
maximum point and that it also has a local minimum point which is strictly 
local. 

6. Sketch the graph of f(x), if f(0) = 3 and f’(x) = —1 for all real values of x. 
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7. Sketch the graph of f(x), if f(—1) = 2 and f’(x) = 4 for all real values of x. 

8. Sketch the graph of f(x), if f(0) = 0 and f’(x) = x for all real values of x. 

9. Construct a function which has a local maximum point, with local maximum 
point defined as in this section, but would not have a local maximum if the 
definition were changed to demand f(a) > f(x), x ¥ a, instead of f(a) > f(x). 

10. Graph the function x! and show that it has a point of inflection where neither 
the first nor the second derivative exists. 


2. Maximum and Minimum Problems. In sketching the graph of a function, 
we spent some time looking for maximum and minimum points, both local 
and absolute. This idea suggests that we can use the same technique to find 
that value (or those values) of a variable which maximize or minimize a 
length, an area, or a profit. For example, what should be the dimensions of 
the rectangular field which can be enclosed with a fixed length of fencing but 
has the greatest area? Or, what are the dimensions of the quart can which 
can be made from the least amount of tin? Or, if the telephone company 
were to reduce the rate per instrument for each new instrument over a certain 
number, what number of telephones would give them the greatest profit? 

These are all problems which can be solved by calculus and, more 
specifically, by the technique developed in Section 1. However, before we 
tackle them, we shall consider the theorems which justify the methods which 
we shall use in solving them. 


(2.1) THEOREM. Jf a belongs to an open interval in the domain of f, if f'(a) 
exists, and if (a,f(a)) is a local extreme point (either a maximum or a 
minimum), then f’(a) = 0. 


Proof. Geometrically this theorem is obvious. We shall prove it, only in the case 
that (a, f(a)) is a local maximum point, since a similar proof (with the in- 
equalities reversed) is valid for a local minimum point. Since (a, f(a)) is a 
local maximum point, f(a + 1) < f(a) for all t in some open interval con- 


taining 0. Thus f(a + 1) — f(a) < 0. If ¢ is negative, fa; Ss. > 0 
and : 
lim 
t—0— 


fe+)— f@). , 
> 0. 


If ¢ is positive, 
flat Be f(@) ae 


; Sie 

lim fat) —- f@ <. 0. 

t>0+ t 

Since f’(a) exists, the two limits above must have a common value which is 
f'(a). Thus f’(a) is both greater than or equal to zero and also less than or 
equal to zero. The only number which satisfies both of these conditions is 


zero, hence f’(a) = 0. This completes the proof. 
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In our sketches we found points where the first derivative vanished. If 
the curve was concave downward at that point, we identified a local maximum 
point; if the curve was concave upward at that point, we identified a local 
minimum point. We summarize this result in the following theorem. 


(2.2) THEOREM. Let f be a function with a continuous second derivative and 
with f'(a) = 0. Then (a, f(a)) is a local maximum point if f'’(a) < 0.and is a 
local minimum point if f’’(a) > 0. 


This theorem is easily proved when we have more mathematics at our 
command. Specifically, it follows quickly from Taylor’s Formula with the 
Remainder (see Problem 13, page 540). For the work at hand, it will be 
sufficient to understand the theorem and to be able to use it. 

If the domain of fis restricted to a closed interval, we may find an abso- 
lute extreme point which lies on the boundary of the interval. Consider the 
function graphed in Figure 11. This function is defined on the closed interval 


(d, f(4)) 


(5, f(6)) 


(c, f(c)) 


Figure 11 


[a, d] and has a local minimum point at (c, f(c)). However, there are several 
points in [a, d] which are lower than (c, f(c)), and the absolute minimum 
point is (a, f(a)). Similarly, (b, f(6)) is a local maximum point but (d, f(d)) 
is the absolute maximum point. This suggests the following theorem. 


(2.3) THEOREM. Let f be a differentiable function whose domain is restricted 
to a closed interval containing a. If (a, f(a)) is an extreme point then f'(a) = 0 
or a is an endpoint of the interval. 


Proof. This theorem is an immediate corollary of Theorem (2.1). Let the domain be 
[c,d]. If a # c and a ¥ d, then a lies in the open interval (c,d) and, by 
Theorem (2.1), f’(a) = 0. If a = c or a = d, then a is an endpoint of the 
interval. 
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For many functions, Theorem (2.3) has the virtue of reducing an ap- 
parently impossible problem to a simple one. In principle, the problem of 
finding the maximum values of a function over a closed interval involves the 
examination of f(x) for every x in the interval, i.e., for an infinite number of 
points. This theorem tells us that we need look only at those values of x at 
which the first derivative vanishes and those which are endpoints. For most 
functions there is only a small finite number of such points. 

Theorem (2.3) tells us where to look for the extreme points of a differ- 
entiable function defined on a closed interval, but it does not guarantee the 
existence of any. To complete the theory, we add a statement of the following 
fundamental existence theorem. 


(2.4) THEOREM. Every real-valued continuous function whose domain is a 
closed bounded interval has at least one absolute maximum point and at least 
one absolute minimum point. 


We omit the proof. The result sounds perfectly obvious, of course, and 
it is obvious in the sense that if continuity means what we want it to mean, 
then (2.4) must be true. To see whether it, in fact, follows logically from our 
definitions, of course, requires proof. Further insight into the theorem may be 
found in Problems 21 and 22, where it is shown that functions that do not 
satisfy the hypotheses of (2.4) can fail to have absolute extreme points. 

Let us now look at the problems suggested at the beginning of the 
section. 


EXAMPLE |. What are the dimensions of the largest rectangular field which 
can be enclosed with 200 feet of fencing? Should it be long and narrow, short 
and wide, or somewhere in between? If we let x be the number of feet in the 
length, then 100 — x will represent the width, and we can write the area 4 
as a function of x: 

A(x) = x(100 — x) = 100x — x? 


(see Figure 12). Note that the domain of A is the interval consisting of all x 


100 —x 


Figure | 2 
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such that 0 < x < 100. We want to find that value of x which will give a 
maximum value of A. Taking derivatives, we obtain A’(x) = 100 — 2x 
and. A” (x) ==2._ Sets A’ equal to zero,“ we find x = 50. © Since 
A'(50) = Oand A’’(50) = —2 < 0, we know that 50 is that value of x which 
maximizes A. Thus the field, which is 50 feet long and 100 — 50 = 50 feet 
wide, is the largest rectangular field which can be enclosed with 200 feet of 
fencing. 


The problem of solving a maximum or minimum problem consists of 
setting up the function to be maximized or minimized and then taking deriv- 
atives. The theorems of this section tell how to proceed from there. 


EXAMPLE 2. What are the dimensions of the cylindrical quart can which can 
be made from the least amount of tin? This problem is important to the 
manufacturer who produces tin cans and is more concerned with the amount 
of tin used than with the shape of the can. Should he make tall cans with a 
small radius or short cans with a large radius? We shall ignore seams and 
assume that tin cans are perfect cylinders. The volume of a cylinder is 
ar°h and a quart contains 573 cubic inches. Thus ifr is the radius of the can 
in inches and h is the height in inches, mr?h = 572 or 23+ (see Figure 13). 


Figure 13 


The area is the sum of the lateral surface area and the area of the bottom and 
the top: A = 2mrh + 2rr?. The area depends on r and A, but we can use our 


204 


volume equation to find / as a function of r: h = Lp? and then write 
Tr 


231 guAngQst 2 
ALP) = 257 (74) + 2rr = ies + 2nr. 


TT. 
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The domain of A is the set of positive real numbers. Taking derivatives, we 


= 231 
haves Ae) = = ae et 4rr and A”(r) = —> + 4. Setting A’ equal to 
pat 
zero; we-have. 8#f?| =231,.0r 7 == 4/-——'= 2.10 (approximately) 
AGCZAGQ =U and AX.) > G, iM Hoi ae a minimum value to A. 
pe | 


= An(2.10) = 4.20 (approximately). The desired can will look square in 
profile, 2.10 inches in radius and 4.20 inches high. This problem could also 
have been pile by writing 4 " afunctionofh:r = Ja and then writing 
A(h) = ana) h+ 2r- re , although this area function is not as nice as 
A(r). Another method involves thinking of / as a function of r, writing A(r) 
containing both / and r, and differentiating implicity. Thus we write 


APES 219 Gig OFS Bey Eee a ae 
Differentiating with respect to r, we obtain 
dh dh 
ur’ ai + rh: 2r = 0 and ary = 2ar + 2rh + 2m - 2r. 
r 


2arh 


2 


dh 2h ae 
From the first, ties = — —. Substituting this in the second, we 
r r 


Tr 


2h 
find A’(r) = dar (— =) + 20h + 4cr = 4crr — 2rh. Setting A’(r) = 0, 
we get J 
h = 2r. 


‘ 231 
Substitution in the volume equation yields mr2(2r) = ee Sarr* = Zany 


as in the other solution. This same method could have been used with 
differentiation with respect to h. 

In each of the preceding examples, the domain of the function was an 
interval of positive real numbers. However, if a problem involves the number 
of objects which will maximize or minimize a particular function, it will 
have a domain of positive integers. In this case, we may still do the problem 
as if it were one with the entire set of real numbers as the domain of the 
function, and then consider those positive integers which lie on either side 
of the x-coordinate of the critical point of this function. 


EXAMPLE 3. A telephone company which serves a small community makes an 
annual profit of $12 per subscriber if it has 725 subscribers or fewer. They 
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decide to reduce the rate by a fixed sum for each subscriber over 725, thereby 
reducing the profit 1 cent per subscriber. Thus there will be a profit of 
$11.99 on each of 726 subscribers, $11.98 on each of 727, etc. What is the 
number of subscribers which will give them the greatest profit? 

If we let x be the number of subscribers over 725, there will then be 
725 + x subscribers and the profit per subscriber will be 1200 — x cents. 
The total profit will be P(x) = (725 + x)(1200 — x) = 870,000 + 475x — x’. 
The domain of P is the set of positive integers, but let us treat the problem as 
if the domain were R. The derivatives are P’(x) = 475 — 2x and 
P''(x) = —2. The value of x which makes P’ zero is 2374, and it also makes 
P” negative, thereby ensuring a maximum for P. We find P(237) = 926,406 
and P(238) = 926,406. Thus the profit is the same for 725 + 237 = 962 
subscribers and for 725 + 238 = 963 subscribers. If we visualize the graph 
of P, we see a parabola concave downward with its maximum point at 
(2375, 926,4064). If we delete all points which do not have integral co- 
efficients, then the points (237, 926,406) and (238, 926,406) are maximum 
points equally spaced on either side of the high point of the parabola and 
just lower than the high point. The profit of the telephone company will 
increase with each new subscriber until they have 962 subscribers. The 
addition of one more subscriber will not alter the profit, but it will then 
decrease with each new subscriber after the 963rd one. 


EXAMPLE 4. Consider the function f defined by 
x? 
2 eee —2 =x =. 4, 


The domain of f is the closed interval [—2,3]. Find the maximum and 
minimum values. The derivatives are f’(x) = x? — 2x = x(x — 2) and 
P= ee se = 1 Setting f(x) = 0, we find solutions’ x= -0 
and x = 2. By Theorem (2.3) we need evaluate f(x) only where x = 0 


and x = 2 [where f’(x) = O] and at the endpoints of the interval, x = —2 
and x=" 3; Pese values are 
Heaa-% 
POs 
I 2): 378 
ICY = 5. 


Thus the maximum value of fis %, and the minimum value is —6. Note that 
the maximum value of the function occurs at two points, one of which is a 
local maximum and the other the right endpoint of the interval. The minimum 
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value, on the other hand, does not occur at a local minimum point, but at the 
left endpoint. The graph of the function is shown in Figure 14. 


Figure 14 


We summarize the techniques for solving maximum and minimum 
problems as follows: 


1. Set up the function to be maximized or minimized. If it can easily 
be set up as a function of one variable, it should be. If it cannot easily be set 
up as a function of one variable, it should be accompanied by an equation 
relating the two variables. 

2. Take first and second derivatives of the function. Set the first deriv- 
ative equal to zero and solve the resulting equation. Evaluate the second 
derivative at those values of x for which the first derivative equals zero. 

3. If the function is a “‘nice”’ function defined on an open interval, its 
maximum will occur where f’(x) = 0 and f’’(x) < 0 and its minimum will 
oceut where f(x) = O and 7") > ©: 

4. If the function has a closed interval for its domain, evaluate the 
function at both endpoints to see if maximim or minimum values occur there. 

5. If the function has a domain restricted to integers, evaluate the func- 
tion at integers near the values which give a maximum or a minimum value 
to the continuous function. 
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Sometimes the geometric or physical properties of the problem make it 
obvious that a critical point is of the desired type, i.e., a maximum or a mini- 
mum. If this happens, it is not necessary to compute the second derivative, 
although it may still be used as a check. The complete behavior of the func- 
tion whose extreme points are desired can always be found by carefully 
plotting its graph. 


PROBLEMS 
1. Generalize on Example 1 to show that the largest rectangle with a fixed perim- 
eter p is a square with side i . 


2. A field is bounded on one side by a stone wall. A rectangular plot of ground is 
to be fenced off, using the stone wall as one boundary and 200 yards of fencing 
for the other three sides. What are the dimensions of the largest such plot? 

3. Find the positive number which is such that the sum of the number and its 
reciprocal is a minimum. 

4. List all local extreme points and all absolute extreme points for each of the 
following functions, noting carefully its domain of definition. Classify each 
extreme point by type. 


(a) 3x° — 5x? + 7; domain: all real numbers. 


(b) 4x? + 3x? — 6x + 5; domain: all real numbers. 
2 

(c) x+ al ; domain: all nonzero real numbers. 
x 


(d) 2x? — 21x? + 60x — 25; domain: all nonnegative real numbers. 


2 


(e) as ; domain: all real numbers except 1. 
XxX — 


(f) 3x* — 20x* — 36x? + 54; domain: all nonpositive real numbers. 

(g) (x — 1)?(« + 1)’; domain: all nonnegative real numbers no greater than 2. 
(h) 2 — (x + 4)?8; domain: all real numbers. 

(i) (x — 1)?(x — 4); domain: all nonnegative real numbers. 


5. Generalize on Example 2 to show that the right circular cylinder with a fixed 
volume and the least total surface area has its diameter equal to its height. 

6. Show that f(x) = x* has an extreme point where the second derivative is 
neither positive nor negative. What type of extreme point is it? Explain why 
this is not a contradiction of Theorem (2.2). 

7. Aline has positive intercepts on both axes and their sum is 8. Write an equation 


of the line if it cuts off in the first quadrant a triangle with area as large as pos- 
sible. 
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8. 


9. 


10. 


+. 


i. 


13. 


14. 


ES 


16. 


17. 


18. 


19. 


20. 
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Find two nonnegative numbers, x and y, such that x + y = 6 and x’y is as 

large as possible. 

Find all ordered pairs, (x, y), such that xy = 9 and \/x?2 + y? is a minimum. 

Interpret your result geometrically. 

(a) Graph the set of ordered pairs (x, y) such that 4x? + y? = 8. The graph 
is called an ellipse. 


(b) Find all ordered pairs (x, y), such that 4x? + y? = 8 and 4xy is a maxi- 
mum. 


(c) Find the dimensions of the largest (in area) rectangle which has sides 
parallel to the x-axis and the y-axis and is inscribed in the ellipse of (a). 


Find the dimensions of the largest rectangle which has its upper two vertices 

on the x-axis and the other two on the graph of y = x? — 27. | 

Find the dimensions of the rectangle which has its upper two vertices on the 

x-axis and the other two on the graph of y = x? — 27 and which has maximum 

perimeter. 

A box without a top is to be made by cutting equal squares from the corners 

of a rectangular piece of tin 30 inches by 48 inches and bending up the sides. 

What size should the squares be if the volume of the box is to be a maximum? 

[Hint: If x is the side of a square, V(x) = x(30 — 2x)(48 — 2x).] 

(a) A box without a top is to be made by cutting equal squares from the cor- 
ners of a square piece of tin, 18 inches on a side, and bending up the sides. 
How large should the squares be if the volume of the box is to be as large 
as possible? 

(b) Generalize (a) to the largest open-topped box which can be made from a 
square piece of tin, s inches on a side. 

(a) Where should a wire 20 inches long be cut if one piece is to be bent into a 
circle, the other piece is to be bent into a square, and the two plane figures 
are to have areas the sum of which is a maximum ? 

(b) Where should the cut be if the sum of areas is to be a minimum ? 

A man in a canoe is 6 miles from the nearest point of the shore of the lake. 

The shoreline is approximately a straight line and the man wants to reach a 

point on the shore 5 miles from the nearest point. If his rate of paddling is 4 

miles per hour and he can run 5 miles per hour along the shore, where should he 

land to reach his destination in the shortest possible time? 

Prove that the largest isosceles triangle which can be inscribed in a given circle 

is also equilateral. 

Prove that the smallest isosceles triangle which can be circumscribed about a 

given circle is also equilateral. 

What is the smallest positive number that can be written as the sum of two 


Es 1 
positive numbers x and y so that - + —- = 1? 
x bg 


An excursion train is to be run for a lodge outing. The railroad company sets 
the rate at $10 per person if less than 200 tickets are sold. They agree to lower 
the rate per person by 2 cents for each ticket sold above the 200 mark, but the 
train will only hold 450 people. What number of tickets will give the company 
the greatest income? 
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21. Consider the continuous real-valued function f(x) = x with domain0O < x < 1. 
Does this function have an absolute maximum point or an absolute minimum 
point? Why is this function not a counterexample to Theorem (2.4)? 

22: Let 


1 . adie yy oe 1. 
f(x) = )x =f) ef 
0, for y= H 


This real-valued function is defined on the closed interval [—1,1]. Draw the 
graph of f(x) and explain why this function has neither absolute maximum nor 
absolute minimum points. 


3. Rates of Change with respect to Time. The values of many physical 
quantities depend on time and change with time. In a mathematical formula- 
tion such a quantity is usually denoted by a variable which is a function of 
time. In this section we are concerned with the instantaneous rates of change 
of time-dependent variables. Let u be a real-valued function of a real variable 
t, where we identify ¢ with time. The rate of change of u with respect to time 
at a given instant t = acan be determined by considering the derivative of u. 
We have already observed in Chapter 1 that the derivative f’(a) of a function 
fata particular number a is the rate of change of the value f(x) of the function 
f with respect to x ata. It follows that the instantaneous rate of change of u 
with respect to time, when ¢ = a, is equal to the derivative: 


ula + d) = u(a)_ 
d 
In a physical application the variable u might denote the number of 
gallons of water in a tank at time ¢, where f-is measured in minutes. Then, 
du 


dt 


d ; 
u'(a) = F (a) = lim 


(a) is equal to the rate at which water is flowing in or out of the tank at 


time t = a and is measured in gallons per minute. If (a) 1s positive, then 
the quantity of water in the tank is increasing when t = a and water 1s flow- 
ing into the tank. On the other hand, if (a) is negative, then the amount 
of water is decreasing at that moment and water is draining out. Finally, if 
- (a) = O, then the amount is not changing at ¢ = a. 


An important example of rate of change with respect to time is velocity. 
For example, consider a car in motion on a straight road. To formulate 
the situation mathematically, we identify the road with a real number line, 
the car with a point on the line, and the location of the car at time ¢ with the 
coordinate s(t) of the point on the line. Thus, s is a real-valued function of 
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the real variable t. The average velocity during the time interval from t = a 
tot = a+ dis equal to the change in position divided by the change in time. 
Denoting this quantity by vz,, we have 

1a + @) aia 


Vay d 


If we graph s(t) on a time-position graph, as in Figure 15, we see that the 
average velocity is the slope of the line segment connecting the point (a, s(a)) 


s(t) 


(a+d,s(a+d)) 


(a, s(a) ) 


Figure 15 


to the point (a+ d,s(a+d)). If, keeping a fixed, we consider average 
velocities over successively shorter and shorter intervals of time, we obtain 
values nearer and nearer to the rate of change of s at t = a. We take this 
limit as d approaches zero as the definition of velocity at a and use the 
symbol v(a) for it. Thus 


eas) <5! Ga) 


o(@) )=- lim vay hm 
d->0 d—0 
Hence velocity is the derivative of position with respect to time, and we 
write v(t) = s’(t), or simply v = s’. Geometrically, the velocity at a is the 
slope of the tangent line to the graph of the function s at the point (a, s(a)). 
For a particle moving on a real number line, a positive value of v(t) 
means that the motion at time ¢ is in the direction of increasing numerical 
values, which is called the positive direction (i.e., if the line is the x-axis, 
then the particle is moving to the right). If v(7) is negative, then the particle 
Is moving in the opposite direction. Finally, zero velocity indicates that the 
particle is at rest. The speed at time ¢ is defined to be the absolute value 
|v(t)| of the velocity. Obviously, the speed measures how fast the particle is 
moving without regard to its direction. 
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Velocity depends on time, and its rate of change with respect to time will 
tell us even more about the motion of a particle. This rate of change is called 
acceleration and is defined to be the derivative of velocity with respect to time. 
Denoted by a, it is the second derivative of position with respect to time: 


iG) ete s Gr). 


The acceleration is the rate of change of velocity with respect to time. 
It may be positive, indicating that the velocity is increasing; zero, telling us 
that the velocity is constant; or negative, indicating that the velocity is 
decreasing. For motion of a particle on a horizontal number line (or the 
x-axis) we have several possibilities. If the velocity is positive and the ac- 
celeration is positive, the motion ts to the right and the speed of the particle 
is increasing. If the velocity is positive and the acceleration is negative, the 
motion is still to the right but the particle is slowing down. If the acceleration 
is zero, the velocity is momentarily not changing. If the velocity is negative, 
the particle is moving to the left and it is slowing down or speeding up, 
depending on whether the acceleration is, respectively, positive or negative. 


EXAMPLE |. A particle moves on the x-axis and its coordinate, as a function 
of time, is given by x(t) = 2t° — 21t? + 60t — 14, where ¢ is measured in 
seconds. Describe its motion. We first take derivatives to find velocity and 
acceleration: v(t) = 6f? — 42+ 60 = 6( — 7#+ 10) = 6(t — 2)(¢ — 5) 
and a(t) = 12t — 42 = 6(2t — 7). At zero time the particle is atx = —14, 
moving to the right with a velocity of 60 units per second. At that moment, 
acceleration is —42, and the particle is slowing down. At time t = 2, the 
particle is at rest (v = O) at x = 38, and the acceleration is still negative: 
a = —18. For the next 14 seconds the particle moves to the left until, at 
t = 4, itis at x = 243, moving to the left with a speed of 134 units per 
second. At that moment, however, the acceleration is zero and, in the next 
instant, the velocity will begin to increase to the right and the particle begin 
to slow down. The particle continues to move to the left for the next 14 
seconds, until ¢ = 5. At that time, the particle is at rest at x = 11 and the 
acceleration is positive. From that time on, the particle will move to the 
right with ever-increasing velocity. Its motion is indicated in Figure 16. 


t=5 


mig =40 0 11 20 30 38 X-axis 


Figure 16 


106 APPLICATIONS OF THE DERIVATIVE  [cuap. 2 


EXAMPLE 2. In a first course in physics, one encounters the formula for 
straight-line motion with constant acceleration, s = sy) + ut + gat’, 
where s is the distance from some fixed point, so the initial distance, vo the 
initial velocity, and a the acceleration. Find v and a, thereby verifying another 
formula which usually accompanies the distance formula and also verifying 
that the acceleration is constant. Taking derivatives with respect to time, we 
obtain v = s’ = Uo + Sa(2t) = vo + at anda = v’ = a. Thus we see that 
the derivative definitions do produce the familiar formulas. 


If a particle is constrained to move in the xy-plane on a circle of radius 5, 
then the point where it is at any time has coordinates which satisfy the 
equation x” + y? = 25. Each of the coordinates, however, is a function of 
time and we may write [x(‘)]}’ + [y@} = 25. Here we have an equation 
which states that two functions of ¢, [x(t)]’ + [y()]’ and the constant func- 
tion 25, are equal to each other. If the two functions are equal, they will 
change with respect to ¢ at the same rate. Taking derivatives to find the com- 
mon rate of change, we have 


£@ +y)= £25, 
which implies 
th ss + 2y 2 a= {), 

We interpret “ as the rate of change of the abscissa of the particle with 
respect to time, or as the velocity in the horizontal direction. Similarly, we 
interpret ay as the velocity in the vertical direction. We use the symbols v, 
for Fe and v, for . Another interpretation of v, and v, is that they are 
horizontal and vertical components, respectively, of the velocity of the par- 
ticle. Using this notation, we write xv, + yu, = Oorv, = — =i These 


equations relate two rates of change, and problems of this type are called 
related rate problems. 


EXAMPLE 3. A particle moves on the circle with equation x? + y’ = 10. 
As it passes through the point (—1, —3) the horizontal component of its 
velocity is 6 units per second. Find the vertical component. We first take 


—= d d 
derivatives with respect to time, (x? + yp?) = . 10, to get 2xv, + 2yv, = 0. 
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We are given that v, = 6 when x = —1 and y = —3. Substituting these 
values in the last equation, we have 2(—1)(6) + 2(—3)v, = 0. Hence 
—12 — 6v, = 0, or v, = —2. The vertical component of velocity is —2 


units per second, indicating that the motion is, at that moment, downward 
and to the right, since v; 1s given positive. It is, of course, obvious that a 
particle which is constrained to move on the circle must be moving down- 
ward if it is moving to the right in the third quadrant. 


EXAMPLE 4. A spherical balloon is being blown up, and its volume is in- 
creasing at a rate of 4 cubic inches per second. At what rate is its radius 
increasing? The volume of a sphere is given by the equation, V = 4rr’. 
Since V and 4rr° are both functions of ¢, their derivatives with respect to f 


qv. 4 d | V 
are equal. Thus geek wae Bet har 7 Replacing oe by 4 and 
dr dr l regret 
solving for aw have eee The rate at which the radius is increasing 
7 


is not constant, but depends on the radius at a particular moment. When the 
radius is 2 inches, it is increasing a inches per second; when it is 5 inches, 
T 


it is Increasing 55, inches per second, etc. 
T 


Most related rate problems are solved by first finding an equation relat- 
ing the variables. Then we may take derivatives to find an equation relating 
their rates of change with respect to time. Finally, we substitute those simul- 
taneous values of the variables and rates which are given to us in the problem. 


PROBLEMS 


1. A ball is thrown upward with an initial velocity of 80 feet per second. Its 
distance above the ground f seconds later is given by s = 80¢ — 167. 


(a) Show that it reaches its highest point when it has zero velocity. 
(b) Show that its acceleration is constant. 

(c) For how many seconds is it going up? 

(d) How high does it go? 

(e) Show that it strikes the ground with a speed of 80 feet per second. 


2. A particle moves on the x-axis and its position ¢ seconds after it starts is given 
by x(t) = 42° — 427? + 135t — 100. Describe its motion. 
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. A particle moves on the y-axis and its position ¢ seconds after it starts is given 


by y(t) = 144¢ — 288 — 1677. Describe its motion. 


. A particle moves on the x-axis and its position ¢ seconds after it starts is given 


by x(t) = 1? — 34 + 5. Describe its motion. 


. A particle moves on the y-axis and its position ¢ seconds after it starts is given 


by y(t) = 34° — 9t + 10. Describe its motion. 


. A particle moves on the ellipse with equation 4x2 + 9y? = 36. When it is 


passing through the point (—3,0), what is the horizontal component of its 
velocity ? 


U, 
. A particle moves on the circle with equation x? + y? = 16. Show that — at 


Vy 
(a, b) is equal to the slope of the tangent to the circle at (a, b). 


. Two ships leave the same dock at noon, one traveling due north at 15 knots 


(nautical miles per hour) and the other due east at 20 knots. At what rate is the 
distance between them increasing at 2 p.M.? 


. Water is pouring into a conical funnel and, although it is also running out of the 


bottom, the amount of water in the funnel is increasing at the rate of 3 cubic 

inches per minute. If the conical part of the funnel is 5 inches deep and the 

mouth of the funnel is 6 inches in diameter, how fast is the water rising when it is 

2 inches deep? 

(a) Show that, at any instant, the ratio of the rate at which the area of a circle 
is changing to the rate at which the radius is changing is equal to the 
circumference of the circle. 

(b) Show that, at any instant, the ratio of the rate at which the volume of a 
sphere is changing to the rate at which the radius is changing is equal to the 
surface area of the sphere. 

A man 6 feet tall walks away from a lamppost 15 feet tall at a rate of 4 miles per 

hour. How fast is his shadow lengthening when he is 12 feet from the pole? 

How fast is the distance from the foot of the lamppost to the tip of his 

shadow lengthening? 

At 3 p.M. a ship which is sailing due south at 12 knots is 5 miles west of a west- 

bound ship which is making 16 knots. 

(a) At what rate is the distance between the ships changing at 3 p.M.? 

(b) At what time does the distance between the ships stop decreasing and start 
increasing ? 

(c) What is the shortest distance between the ships? 

A ladder 20 feet long leans against a vertical wall. The bottom of the ladder 

slides away from the wall at a constant rate of 1 foot per second. At what rate 

is the top coming down the wall when it is 12 feet from the ground ? 

A particle moves on the parabola with equation y = x?. The horizontal com- 

ponent of the velocity at each point is equal to twice the abscissa of the point. 

Show that the vertical component of the velocity at each point is equal to four 

times the ordinate of the point. 

Sand is being poured on the ground at a rate of 4 cubic feet per minute. At each 

moment, it forms a conical pile with the height of the cone 4 of the radius of the 

base. How fast is the height of the pile rising when 217 cubic feet of sand is in 
the pile? 
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4. Approximate Values. If we can find the values of a function and its 
derivative at some particular number a, then there is a useful method for 
obtaining an approximation to the value of the function at any number near 
a. For example, knowing that \/9 = 3, we can easily obtain a good ap- 
proximation to 1/9.1. Similarly, we can use this method to find simple ap- 


1 tease 
proximations to such numbers as 103’ ~/26.8, and (32.1)'/*. 


To obtain the approximation for ./9.1, we consider the graph of the 
function f defined by f(x) = +»/x and drawn in Figure 17. The tangent line 


y-axis 


(9,3) y= x 


tangent line 


x-axis 


Figure 17 


to the curve at the point (9, 3) touches the curve at that point and is not 
very far away from it for values of x near 9. Although it is tedious to find a 
decimal approximation for the ordinate of the point on the curve y = +/x 
with an abscissa of 9.1, it is relatively easy to find one for the point on the 
tangent line with that abscissa, and the two points are not very far apart. 


lima l l ] 
Since a i/x= ie the tangent line has a slope of IT € and has an 


equation 
oe Be ee Y). 


Ifx = 9.1, y = 3+ 400.1) = 3%, or 3.017. Thus 3.017 is our approxima- 
tion for \/9.1. That it is a good approximation may be seen by checking a 
table of square roots to find \/9.1 = 3.016621. The same tangent line may 
be used to approximate 1/10, but the accuracy will not be as good. If x = 10, 
y = 3+ 2 = 3.167. The tables give 3.162278 for 10. 

The technique used in the problem above is the computation of an 
approximate value of f(x) under the assumption that the difference x — a 
is small in absolute value and that both f(a) and f’(a) are known, or can 
be easily evaluated. We write an equation of the tangent line to the graph 
of the function f at (a, f(a)) and take the ordinate of the point on the line 
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with abscissa x as the approximation to f(x). The tangent line has equation 
y —f(@ = f'(a(x — a), or, equivalently, 

V.=I@ + LOX — A. 
The function f(a) + f’(a)(x — a) is a linear function of x, and is the linear 
function which best approximates f(x) for values of x near a. The approxima- 


tion consists of simply replacing the true value f(x) by the corresponding 
value of the linear function. The result is summarized in the formula 


IQ) = f/@ + Ff @O@-—@, 7 - (1) 


in which it is assumed that |x — a] is small and the symbol ~ indicates 
approximate equality. 


I 1 
EXAMPLE |. Find an approximate value of 402° If we define f(x) = = then 
1 | 
f(4) = 4 = 0.25 is easily evaluated. Moreover, f’(4) = — i ee 


— 0.0625, and 0.02, the difference between 4.02 and 4, is small. Thus ro 


is approximately equal to 4% — 7¢(4.02 — 4) = 0.25 — (0.0625)(0.02) = 
0.24875. 


EXAMPLE 2. Compute ~/26.8 approximately. If we let f(x) = x/?, then 
fQ)) = 3. Since f’(x) = 4x7", we obtain ('(27) = 427)" = 3. The 
difference |26.8 — 27| = 0.2 is small. Thus we approximate ~/26.8 by 
3 + 37(26.8 — 27) = 34+ 3(—0.2) = 2.9926. A table of cube roots gives 
a more exact value of 2.992574, but the linear approximation gives four- 
decimal accuracy. 


An alternative point of view is obtained if we substitute x = a + fin(1). 
The left side of the formula becomes f(a + 1) and the right side is then 
f@+f'@ (a+ 1p — a) = f(a) + tf'(a). Hence, we obtain the equiv- 


alent formula 
fa+p)=f@-+ tf’@, (2) 


which gives an approximate value of f(a + 1) in terms of the known quan- 
tities f(a), f’(a), and t. The same result can also be obtained easily from the 
definition of the derivative of the function f at a, 


f(a) = tim = 3 
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It follows that if ¢ is nonzero and small in absolute value, then f’(a) is given 
approximately by 


fila) ~ a 


which immediately implies (2). 


PROBLEMS 


1. Give three-decimal approximations for each of the following numbers. 


— @) Oey ©) Gor 
(b) 64.2 (ec) 16.6 a @377" 
(c) 3/3585 (f) 16.6 (i) (0.95)°. 


2. tf f@) = =. , find an approximation for f(4.92). 
x 


_Iff(x) = Vx — 2, find an approximation for f(27.3). 
_ If f(x) = V7x2 — 3, find an approximation for f(1.9). 
5. (a) Find the volume of a sphere with a radius of 3 inches. 

(b) Find the approximate volume of a sphere with a radius of 3.1 inches. 
6. (a) Find the volume of a cube 6 inches on an edge. 

(b) Find the approximate volume of a cube 5.9 inches on an edge. 
7. Find an approximate value of 1.97\/(1.97)2 + 4. 
. Find an approximate value of the product (63.2)!/3(63.2)!”. 

9. (a) Find the area of an equilateral triangle 4 inches on a side. 
(b) Find an approximation for the area of an equilateral triangle 4.08 inches 
on a Side. 

10. The point P = (2, 1) lies on the curve defined by the equation x*y + xy? = 10. 
Find an approximation to the y-coordinate of the point on the curve near P 
with x-coordinate equal to 2.14. 


WwW 


ESS 


(oe) 


5. Rolle’s Theorem and Its Consequences. There are certain theoretical 
properties of differentiable functions which we are in a position to prove 
and which will aid us in our future work. Two of them express ideas which 
are geometrically obvious but, nevertheless, require proof. The first, due to 
the French mathematician Rolle and named for him, is illustrated in 
Figure 18. The assertion is that a differentiable function which has a graph 
crossing the x-axis at a and also at b must have on its graph at least one point 
between the crossing points where its tangent line is parallel to the x-axis. 


(5.1) ROLLE’S THEOREM. Assume that a < b and that the function f is 
continuous on the closed interval [a, b] and differentiable on the open interval 
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(a,b). If f(a) = f(b) = 0, then there exists a real number c such that 
a<c < band fi’ (ce) = 0. 


Figure 18 


Proof. If f(x) = 0 for every x in [a, 4], there is nothing to prove because f is then a 
constant function and f’(x) = 0 for every x in the interval. So we assume that 
fis not constant on [a, b]. By Theorem (2.4) a function which is continuous at 
every point of a closed bounded interval has at least one absolute maximum 
point and at least one absolute minimum point. Since f(x) does not equal 
zero for all x in the closed interval and since f(a) = f(b) = 0, the function 
f must have one of these extremes in the open interval. Let the abscissa of this 
point be c, and it follows by Theorem (2.3) that f’(c) = 0. This completes 
the proof. 


The reader should try to construct functions which do not satisfy all the 
conditions of the theorem to see why the conclusion will not then hold, and 
hence why all the conditions are essential to the theorem. One such example 
was graphed in Figure 7. 


(5,f(5)) 


| (a, f(4)) 


Figure 19 


The second theorem tells us that a function which has a smooth graph 
between (a, f(a)) and (6, f(b)) has a point in between these two where the 
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tangent to the graph is parallel to the line segment connecting these points 
(see Figure 19). Because the point lies between the other two and the tangent 
at this “‘in-between”’ point is parallel, this theorem is called the Mean Value 
Theorem. 


(5.2) MEAN VALUE THEOREM. Assume that a < 6 and that the function 
f is continuous on the closed interval [a, b] and differentiable on the open interval 
(a, b). Then there exists a real number c such thata < c < band 


f(c) = = ) 


or, equivalently, such that f(b) — f(a) = f’(c)(6 — a). 


Proof. This theorem is proved as a corollary to Rolle’s Theorem by constructing a 
function which satisfies the conditions of Rolle’s Theorem and gives the same 
result as if we had tilted the graph of Figure 19 and dropped it down. Such a 
function is 


F(x) = & — a)f() + @ — df) + © — )F@. 


Since f is continuous on the closed interval and differentiable on the open 
interval, it follows that F is, too. We also observe that 


F(a) =0+ @— df@+ 6 —- af@ 
PO) = 6 = OfO) + eb) fOr 0 


Thus the function F satisfies all the conditions of Rolle’s Theorem, and hence 
there is a real number c strictly between a and b such that F’(c) = 0. The 
derivative of F is given by 


F(x) = f6) + @—-b) fe) — F@): 


0, 
0. 


| 


It follows that 
F'(c). = f6) + — d)f'O — f@ =.9, 
which implies that f(b) — f(a) = (6 — a)f’(c). This completes the proof. 


Here again it is a good idea to try various examples to see why all the 
hypotheses of the theorem are necessary. Note that the equation which forms 
the conclusion of the Mean Value Theorem is equivalent to the one obtained 
by interchanging a and b. Thus, if b < a, the Theorem remains true with 
[a, b] and (a, b) replaced by [b, a] and (b, a), respectively, and the inequalities 
axe < breplared myn ise <a. 

One of the most important consequences of the Mean Value Theorem is 
that a function which has a zero derivative on an interval must be a constant 
function on that interval. 
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(5.3) If f’(x) = 0 for every x in an interval, then there exists a constant k 
such that f(x) = k for every x in the interval. 


Proof. Pick an arbitrary point a in the interval, and set k = f(a). Let b be any 
other point in the interval. We shall show that (4) = k also, and this will 
complete the proof. To be specific, let us assume that a < 4; an exactly 
analogous argument can be made if 6 < a. By the Mean Value Theorem 
we know that f(5) = f(a) + (6 — a)f’(© for some number c in the open 
interval (a, b), which is a subinterval of the larger interval referred to above. 
It follows from the hypothesis that f’(c) = 0. Since f(a) = k, we get f(b) = 
k + (6 — a)-0 = k, and the result is proved. 


The preceding theorem has an important corollary—that two functions 
with the same derivative over an interval differ by a constant. 


(5.4) THEOREM. /f f’ = g’, on an interval, then f and g differ by a constant 
function on the interval. 


Proof. Since f’ = g’, we have (f — g)’(x) = f’(x) — g’(x) = 0 for every x in the 
interval. By (5.3) there exists a number k such that k = (f — g)(x) = 
f(x) — g(x) for every x in the interval. This completes the proof. 


The significance of Theorem (5.4), which will be fully exploited in the 
study of integration in Chapter 4, is that it gives a way of describing the set 
of all functions which have a given function as derivative. Specifically, let f 
be a function whose domain contains an interval J. Suppose that in one way 
or another we can find a function F with the property that F’(x) = f(x), for 
every x in J. Then the set & of all functions whose derivatives equal f on I 
consists of all functions which on I differ from F by a constant. To prove this 
assertion, we first observe that, for every real number c, the function defined 
by F(x) + c has derivative equal to F’(x) + 0 = f(x), for each x in J. 
Hence every function F + c belongs to the set §. Conversely, if G is any 
function in the set s, then by definition G’ = f = F’ on J. It follows by 
(5.4) that there exists a real number c (a constant) such that G(x) — F(x) = c, 
for every xin J. Thus on / the function G differs from F by a constant, and so 
the assertion is proved. 


EXAMPLE 1. If fis the function defined by f(x) = x? + 2x, find the set of all 


functions with derivative equal tof. In this case the interval J is the set of all 
3 


eee : ‘ Rees: * 
real numbers, and it is easy to see that one function in the set is x + x, 
since 


3 
~ (5 = ~') =x 22 — 7. 
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Hence each function G in the set is defined by 


3 
6) - > tx + 


for some real number c. As c takes on all real number values, we get all 
members of the set. There are no other possibilities. 


PROBLEMS 


1. For each of the following functions find those values of x for which f(x) or 
f’(x) vanish and use them to verify Rolle’s Theorem. 


():. eae Di $d = EEO 
(b) f0) = 12x — x Cale ee : 
© fo) = Gee DG-—-2 © (= 0-2 


2. For each of the following functions and the specified values of a and 4, find a 
number c such that a < c < band f(b) = f(a) + (6 — a)f'(c). 


(a) f@) = «ee = 1, b= 4 
(b) f(x) =x,a=0,b=1 


() fx)=- 2 a=1,5=3 


(d) fe) = Sa = 1b = 2. 


3. Consider the function f(x) = 1 — |x| defined on the closed interval from —1 
to 1. Which hypotheses of Rolle’s Theorem does this function satisfy and 
which does it not satisfy? Does this function satisfy the conclusion of Rolle’s 
Theorem ? 

4. Consider the function f defined on the closed interval [4, 7] by 


f(x) = 0, x = 4, 
f®%) =7-x, ee aie eae & 


Show where this function fails to satisfy the conditions of Rolle’s Theorem, and 
that it does not satisfy the conclusion. 

5. For each of the following functions f, find the set of all functions with derivative 
equal to f. 


(a). pO 4x - (<c), fehe “ 


(b) f(x) = 4° + x? + 2 (d) f(x) = a 


6. Prove that, if on an automobile trip the average velocity was 45 miles per hour, 
then at some instant during the trip the speedometer registered precisely 45. 
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6. The Differential. If y = f(x), we have denoted the derivative of f by 
df dy 


7 Or a oS The value of the derivative at a number a is written f’(a), 
x 2 


d d . 
or a (a), or a (a). Up to this point, the expressions df, dy, and dx by 
x x 
themselves have had no meaning other than as parts of notations for the 
derivative. However, the cancellation suggested by the Chain Rule 


dy _ dy du 


dx du dx 


indicates that the derivative behaves like a ratio and suggests that it may be 
possible to sensibly regard it as such. In this section we shall define a mathe- 
matical object called the differential of a function, examples of which are 
df, dy, and dx. The ratio of df, or dy, to dx will be equal to the derivative. 


dS 
ee. 
a 


Figure 20 


If fis a function having a derivative at a, we define its differential at a, 
denoted by d,f, to be the linear function whose value for any number f¢ is 


(daf)(t) = f' (at. 


For example, if f(x) = x? — 2x, then the differential d, fis the function of f 
defined by f’(a)t = (2a — 2)t. In particular, 


sy GE Se ea ae 


The value of the differential for a typical function fis illustrated in Figure 20. 
By simply df we mean the rule (or function) that assigns the linear 
function d,f to each number x in the domain of f’. 


(6.1) If f and u are differentiable functions, then 


df(u) = f’(u) du. (1) 
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Proof. This formula is an abbreviation of the equation 
di f(u) = f’(u@)) dau. C4) 


The proof is an application of the Chain Rule. We first write down the two 
linear functions d,f(u) and d,u. By the definition of the differential they are 


(d,u)(t) = u'(x)t, 
(d.f(u) (a) = [LAY Od]. 
The Chain Rule says that [f(u)]'(x) = f’(u(x) u’(x). ~~ Hence 
(d.fW)O = [f'(u@) )u'Qd]t 
= f" (u(x) (d.u)(t). 


Thus (2) appears as an equality between linear functions, and the proof is 
complete. 


If wis the independent variable x, then (6.1) reduces to the formula 


df (x) = f’(x) dx. (3) 


EXAMPLE |. Evaluate the following differentials: 
a w+ 2), 
(b) dy/x? + 3, 
aioe — x). 
Using formula (3), we get immediately 
(a’) d(x’ + 2) = 2x dx 
(b!) dy/x? + 3 = x(x° + 3)" dx 
ye a = 0 — x) (4x = 1, 


It is worthwhile learning to use the stronger formula (1). In problem (b), 
let f be the function f(u) = \/u. If we setu = x? + 3, then du = 2x dx and 


dy\/x2 + 3 = df(u) = f'(u) du 
ss ig ” du 
sie 3) 2x dx. 
Let us also do problem (c) using (1), but without explicitly making the sub- 
stitution u = 2x” — x. We get 
d(2x? — x) = 7(2x7 — x)’ d2x* — x) 
= 7(2x” — x)’ (4x — 1) dx. 
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Formula (3) establishes the fact that the ratio of df to dx is equal to the 
derivative f’. We can see this in greater detail by going back to the definitions: 


(daf)(t) = f' (apt, 
(d xii) xi 
Since x is the identity function, its derivative is the constant function 1. Hence 


(d,x)(t) = t. The ratio of the two linear functions d,f and d,x is thus the 
constant function 
daf _ f(ayt 


dx t 


= f'@. 


Having proved this formula for every a in the domain of f’, we can write it 
simply as if af 
de (a) = f'(@) del ed 


dx 


If f and g are differentiable functions, then it is easy to show that 
di(f + g) = dif + dag. The proof involves only the definition of the 
differential plus the fact that the derivative of a sum is the sum of the deriv- 
atives. In detail: 


[dil f + 9M = (F4+ Y@OIO = '@+ r’@lo 
= fat + gat = Gif) + Gag) 
= [dof + d,g\(t). 
The result is simply the equation 
d(f+raj=c7 = 


An analogous argument using the Product Rule for differentiation shows that 
dal fg) = f(a)dag + g(a)daf, or, more simply, 


: d( fg) = fdg + gdf. 


For each one of the six differentiation rules proved in Section 7 of Chapter 1, 
there is an analogous rule in terms of differentials: Let wu and v be differ- 
entiable functions, and c a constant. Then 


d(u + v) = du+ do, 
acu) = cde; 

dc = Q, 

d(uv) = udv + vdu, 


=" ; : 
du’ = re aa where r is any rational number, 


- iho u dv 
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Note that we have replaced the analogue of (v) in the list in Section 7 
of Chapter | by the formula corresponding to the more powerful theorem 
(8:2) of Chapter 1. 


EXAMPLE 2. Find the differential d(x* + ./x? + 2x)’. Applying the above 
formulas successively, we get 


Came ae ae a) 
T(x? + V x2 + 2x)’ d(x® + Vx? + 2x) by (v) 
= 71(x° + Vx? + 2x)(dx? + dV x2 + 2x) by i) 
Bee Sey ox ax 4 1x” 4 2x)? do’ + 2x)] by (v) 
Wee ee 4x? + 2x) 13x ax + 100" + 2x) "Ox dk + 2 dx) 
by (i) and (11) 


a (= ig — om 


The derivative is therefore given by 


3 RESP we 
oe) ie 
dx Vx? + 2x 


The task of computing the differential of a complicated function of x 
amounts to successively working the differential operator d through the given 
expression from left to right. At each stage one uses the correct one of 
formulas (i) through (vi), or formula (1), until one finally reaches dx, and the 
process stops. The derivative can then be obtained by dividing the resulting 
equation by dx. Note that an equation of the form df(x) = --- will always 
contain the symbol d on the right side. Equations such as dx’ = 5x* are not 
only false; they are nonsense. (Correct version: dx’ = 5x* dx.) 


EXAMPLE 3. Consider the functions 


G@) y = (4x' + 3x° + 1), 


es 


—_—_———— 9 


(b) Eo 
(c) z= 3y’, 
Find the differential of each: 
(a’) dy = 2(4x* + 3x” + 1) d(4x’ + 3x° + ]) 
2(4x° + 3x? + 1)(12x" dx + 6x dx) 
12x(4x*° + 3x? + 1)Qx + 1) dx, 
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(b’) dy = eo) a(x" = Lee eae 


G2 + 1p 
= (x +1 )2x dx — (x ita . Ana 
aie (x2 + 1) a (x2 + 1)2 


(cc) w= C)G)y "dy = 3 ae 


If we consider the composition of the function y in (b) with the function 
z in (c), we get for the differential of the composition 


4x dx i 4x 
a ae Se = ©, 
dz = Sy (2 + 1p ) (5 i ) (2 + 1p ae. 


One traditional interpretation of the differential, which is especially 
useful in physics, is that of an “infinitesimal.” If y = f(x), we know that 
dy = f’'(x) dx. Now dx is the function that assigns to every real number a 
the linear function defined by (d,x)(t) = x’(a)t = t;i.e., it assigns the identity 
function. Hence we can interpret dx as simply another independent variable. 
Then dy is the variable whose value for a given x and dx is shown in 
Figure 21. (Compare this illustration with Figure 20.) The difference between 
the value of fat x and at x + dx is denoted by Ay in the figure. If dx is chosen 


Ay 


x+dx 


Figure 21 


very small, then the difference between dy and Ay is relatively negligible. 
Hence dy measures the resulting change in the value of y = f(x) correspond- 
ing to an infinitesimal change dx in the variable x. 


EXAMPLE 4. The height / of a square pyramid is found to be 100 feet, and the 
length x of one edge of its base is measured to be 160 feet. The volume V 
of the pyramid is given by the formula V = 4hAx?. What error in the computed 
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volume will result from an error of 4 inches in the measurement of x? If we 
consider / as fixed, and V as a function of x, then 


dV = thdx” = 2hx dx. 


Since 4 inches 1s small compared with 160 feet, we set dx = 4inches = 3 foot. 
The resulting change in volume is then approximately 


dV = 2(100)(160) = 32 1000 = 3555 feet’. 


The percentage error in volume is 


dV. Any Gx 5 dx 


Se eee 


ii. 


d. 5 : 
We compute sea as = 0.0021 =.0.21%, and so the. percentage-error. in 
b 


volume is only 0.42%. 


PROBLEMS 


1. Find the following differentials. 


2 
(a) dx’ +x+1)=--- (f) ‘(uy 
(b) dix + 2) = ++: (g) d(az*+bz+c)=":: 


(a, b, and c are constants) 


Pee + Gx — 1) — --- (hk) dVIi+VIitx =": 


x — 1 
(d) a(2pt)--- (i) dx =": 


(ce) du'=--- (j) du’ + 20 — 1) =°-°:. 


1 a 
2. If y = 7x* + 2x + 1 and w = —, compute the differential of the composition 
y 


of y with w. That is, compute dw in terms of x and dx. 


a. SE ace 16 2 and y= = and z = y? + 1, compute dz in terms of ¢ and df. 
x 


4. Using Leibnitz’s Rule (the Product Rule), prove that d.(fg) = f(a)dag + gladif, 
thereby establishing rule (iv) on page 118. 

5. Using Theorem (8.2) of Chapter 1, prove that df" = rf(a)"‘d.f, thereby 
establishing rule (v) on page 118. 

6. What is the approximate change in the volume of a sphere of radius 10 feet 
resulting from a change in the radius of 1 inch? 
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7. A metal cylinder is found by measurement to be 3 feet in diameter and 10 feet 
long. What will be the error in the computed volume of the cylinder resulting 
from an error of 


(a) 1 inch in the diameter? 
(b) 0.5 inch in the length? 
(c) both the errors in (a) and (b) combined? 


8. If f(x) = kx, in what sense is f its own differential ? 


7. L’H6pital’s Rule. This section contains a number of theorems which 
provide an important technique for finding the limit of the quotient of two 
functions. These theorems are usually referred to collectively as L’H6pital’s 
Rule. Two examples of problems which are readily solved by this technique 
are the computations of the limits: 


/x — V2 ; xe 1 
lim ————— and im —————_ - 
x32 Wy — WZ ee 


Properly speaking, this section is a continuation of Section 5, since all the 
results are corollaries of Rolle’s Theorem and of the Mean Value Theorem. 

The following proposition, known as the Generalized Mean Value 
Theorem, is the basic lemma used in proving the theorems which make up 
L’H6pital’s Rule. (A lemma is a theorem included as a reference in proving 
other theorems.) 


(7.1) GENERALIZED MEAN VALUE THEOREM. Assume that a < b, and 
let f and g be functions which are continuous on the closed interval [a, b] and 
differentiable on the open interval (a, b). If g’(x) ¥ O for every x in (a, b), then 
there exists a real number c in (a, 6) such that 


TEP ey > Fe). 


g(b) — g(a) g'(c) 
Note that g(b) — g(a) 4 0. For otherwise the Mean Value Theorem 
would imply that g’(x) = 0 for some x in (a, 6), which is contrary to hy- 
pothesis. 


Proof. Let / be the function defined by 


wily = fae 
gb) — gi 


for every x in [a,b]. The function / is continuous on [a, 4], differentiable on 
(a, b), and, in addition, A(a) = h(b) = 0. Hence, by Rolle’s Theorem, there 


HOO: (Oh = flake y fg (x) — g(a)), 


A 
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exists a real number c in (a, 5) such that h’(c) = 0. Since 


ae me a Z 
we obtain 
ee A), 
a err a 


and from this equation the conclusion of the theorem follows at once. 
We first present L’H6pital’s Rule as a theorem about one-sided limits. 


(7.2) L°HOPITAL’S RULE I. Let f and g be functions which are differentiable 
on a nonempty open interval (a, b) with g'(x) # 0 for every x in (a, b). If, in 
addition, 


a her f(x) = tim g(x)= 9, - 


rat r—a+ 
, Ee 
oa 
then 
G2)... 
zie g(x) 


Proof. We may assume that f(a) = g(a) = 0. (If this is not the case to begin with, 
we simply define, or redefine, the values of fand g to be zero ata.) Thus we 
ensure that fand g are continuous on [a, 5). Let x be an arbitrary nun.ver in 
(a, b). Then fand g are continuous on [a, x] (recall that differentiability ata 
point implies continuity) and are differentiable on (a, x). Moreover, the 
derivative g’ does not take on the value Zero in (a, x). Hence, by the Gener- 
alized Mean Value Theorem and the fact that f(a) = g(a) = 0, we obtain 

£Q) _ f@) = 0 _ fe), 


(vy) a(x) — 0 g(x) 
for some number y in (a, x). As x approaches a from the right, so also does 
y, and hence lim f(x) lim = = L. This completes the proof. 
265 g(x) yat B(y) 


x—+V/2 = “3 
EXAMPLE 1. Compute lim A Let f(x) = Vx — V2 and g(x) = 
xr 2+ x= 
\/x — 2. Obviously, lim f(x) = lim g(x) = 0, and, since 
xr—2+4 x 2+ 
/ f | l 
(Gs PRG 2. EN 


2x ws = 9 
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f and g are differentiable, and g’ does not take on the value zero on any open 
interval with left endpoint equal to 2. We obtain 


| 
ion Ee a fey ie 
xr— 2+ 4 (x) xr 2+ | 
2/x — 2 
. Vx—2 0 
=i = = 
rt  /x V2 
x— V2 
And it follows by L’H6pital’s Rule that lim eee = th 
xr— 24 /Xx is a3 


It is a simple matter to verify that Theorem (7.2) remains true if (a, b) 
is replaced throughout by (b, a), and lim is replaced throughout by lim . 


This fact, significant in itself, also implies the following two-sided form of 
L’HOpital’s Rule. 


(7.3) L’HOPITAL’S RULE Il. Consider an open interval containing the number 
a, and let f and g be functions differentiable and with g'(x) # 0 at every point 
of the interval except possibly at a. If 


Glin Fy = line etry = 0, 


Cee ae & © 5 ee 
Space ak 3 eben 


then 


; way SAX) _ 
ae g(x) - 


The hypotheses of (7.3) have been taken as weak as possible. If, as 


frequently happens, the functions f and g are also continuous at a, then (i) 
can be replaced by the simpler condition f(a) = g(a) = 0. 


1/2 1/2 


oe 


b] 
z—a xis — qils 


EXAMPLE 2. Evaluate lim where a> 0. If fQ) = 4° =a" 
l 


and if g(x) = x!/* — q'/*, then the derivatives are given by f’(x) = 55178 
x? 


| 
33218” and it is clear that f and g are differentiable (and hence 


continuous) and g’ is not zero on an open interval containing a. Moreover, 


and 2x) = 
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f(a) = g(a) = 0. Hence, by L’H6pital’s Rule, 


1 
pe ee Se 
xa baal sls xr—a l dail? 2 
3x2/8 
21 Bi a : 
EXAMPLE 3. Compute lim -; , Where a # 0. Doing this problem by 
za X —4 


L’H6pital’s Rule is somewhat akin to smashing a peanut with a sledgeham- 
mer. The fact that 


Se © eee, | 


| 
ae (x — a(x + a) gee ee 


immediately implies that 


As a 


2 Sige a eee 
ke a 
Of course, the same answer is obtained by L’HG6pital’s Rule. If we let 
ate ke oe x) = x" a, then f(a) = sta)-= 9 and f'(x) = 1 
Bnd-2- (x)= 2x. . Henee 


It is important to realize that L’H6pital’s Rule II can be applied only 
if the function oe is undefined at x = a and if lim f(x) = lim g(x) = 0. 
x w~— a Er AS 


For example, if f(x) = x? + 3x — 10 and’ g(x} = 3x, then 
yy: x +3x—10_ 0 


a 
but 
ee 
oe SS 


If the hypotheses of Theorem (7.3) are satisfied for the functions f’ 
and g’, that is, for the derivatives of fand g, respectively, then we can conclude 
that 

Bt eG) fst fo") 
a. = 


This fact suggests the possibility of applying L’H6pital’s Rule more than once, 
and in some problems it is necessary to take second or higher derivatives to 
find the limit. 
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1/3 


—x-—2 
EX E 4. Evaluat ger eee aaeerimate Let se Qat 2o eee 
AMPL valuate ree ear ate ~ et 7 (x) x x — 2 and 


g(x) = 3x? — 6x + 3. Then f(1) = g(1) = 0, and the derivatives are 
given by f’(x) = x 7/8 — 1 and g’(x) = 6x — 6. However, the value of 


es ee ee | 
- Pe ant 6x — 6 


is not obvious because f’(1) = g’(1) = 0. Taking derivatives again, we get 


f(x) = —2x75/8 and g(x) = 6, and it follows that 
i. ee ee 
xz 1 g(x) ppt 6 9 


Thus two applications of L’H6pital’s Rule yield 
fe)... ee 


1 / r ial 
aa a(x) Ce 4 (x) x—1 g' (x) 


\O| = 


A variation of (7.2), not difficult to prove, is the following: 


(7.4) L°HOPITAL’S RULE I. Let f and g be differentiable on an open interval 
(a, ©) with g’(x) ¥ Oforx >a. If 


(i) lim f(x) = lim g(x) = 0, 


An analogous theorem holds if (a, «) is replaced by (— #, a) and if 
lim is replaced throughout by lim . 


1 

Proof. The result is a corollary of (7.2) and the Chain Rule. Let t = —, and set 
1 1 : 

F(t) = § (*) = f(x) and G(t) = ‘(*) = g(x). Since ¢ approaches 0 


from the right if and only if x increases without bound, 


lim F(t) = lim s(?) = lim f(x) =, 


t>0+ t>0+ I— 00 


lim’ Gt) = lim g (*) ==; net) eee Qi 


t>0+ t—0+ I— 00 


By the Chain Rule, F’(t) = f’ (*) (- ‘) and G’(t) = g’ (;) (- x): 
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e ® I 1 
lim se = im PAD NT 2) () 5 j . 
0+ G'(t) 0+ (:) ( ) ge oe (:) 
UAE: mG 


The last limit exists and is equal to L since 


7 
ry pg rary 


lim =' lim 


Hence 


t—0+ ; (!) L—>00 g’(x) 3 
ae 
eae a Fo 
By L’HOpital’s Rule I it follows that lim G(r) = I. Hence 
t> 0+ f 


fim £2 tim FO 
niles) (aan 7 alata 


and the proof is complete. 


An important observation is that all the forms of L’H6pital’s Rule 
developed so far are valid whether L is finite or not. This fact requires no 
new proof and has really already been established. The reason is that the 
basic conclusion of Theorem (7.2) is the equation 


ray fey 


lim —<= lim += 
r—a+ g(x) z— a+ § (x) 


and this holds good whether or not lim = is finite or infinite. 
zra+ JX 


There is another significant variation of L’H6pital’s Rule, whose proof, 
although requiring only the Generalized Mean Value Theorem, cannot (as 
far as we know) be obtained from (7.2) by a simple substitution. It states 
that the several forms of condition (i), lim f(x) = lim g(x) = 0, can be 
replaced by lim |g(x)| = «. The specific statement which we prove is the 
following: 


(7.5) L’HOPITAL’S RULE IV. Let f and g be functions which are differentiable 
on a nonempty open interval (a, b) with g'(x) # O for every x in (a, b). If 


(Gi) lim |g(x)| = 0, 
xr—a+ 


rat g'(x) 7 


Gi “ey SOS 


=) 


128 APPLICATIONS OF THE DERIVATIVE  [cuap. 2 


then 


5 ORO. 


r—a+ g(x) 


Proof. Let e be an arbitrary positive number. By hypothesis (ii), there exists a real 
number c in (a, 5) such that 


Pees. 
g’(x) 


L nae for every x in (a, c). 


By hypothesis (i) there exists a real number d in (a, 6), which we shall for 
convenience assume to be in (a, c), such that, for every x in (a, d), the follow- 
ing three inequalities hold: 


f(c) 
g(x) 


(see Figure 22). It is a consequence of the last inequality that 


pee 
g(x) 


g(x) = 0; < € 


Figure 22 


<il+e for every x in (a, d). 


Now let x be an arbitrary real number in (a,d). By the Generalized 
Mean Value Theorem, there exists a real number y in (x, c) such that 


- JOS = FO 


g(x) — gc)’ 
Hence 


es ee 2 (g(x) = gle) + FC) 


Dividing by g(x), which cannot be zero, we get 


pica ae (: = )) foe 
20), 203 g(x) g(x) 


An equivalent equation is 


f(x) ee pe . ce) 5,2 oe 
— (S2 Z ) (: #0) L eat a 
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From the general properties of the absolute value [see specifically (1.3) and 
(1.4), page 7], it follows that 


f(x) | f') | | g(c) eey a | fe) 
——-~_—[) < — LL} j\1 — —=| + |L| |—— 
g(x) g’(y) g(x) g(x) g(x) 
Hence, the inequalities established in the first paragraph of the proof imply 
that 
0 Sis 84 Cade 
g(x) 
Since the right side of this inequality can be made arbitrarily small by taking e 
sufficiently small, it follows that lim f(x) = L, and the proof is complete. 
xra+ EX 


It is not difficult to derive variations of the preceding theorem analogous 
to the modified versions of (7.2) described above. Thus, with the obvious 
changes in the hypotheses, this last form of L’H6pital’s Rule also holds for 
two-sided limits and with a or L (or both) replaced by +o. 


3 l 
EXAMPLE 5. Compute lim = . Let f and g be the functions defined by 
x 
wo l 


f(x) = Vx + land g(x) = x + 4, respectively. Since f’(x) = 3@ + DF 


and g’(x) = 1, we see that f and g are differentiable on the interval (1, «) 
and that g’(x) # 0. Moreover, lim |g(x)| = lim |x + 4] = o, and 


l 
/ 
lim £%) ee les le ee 
LO & (x) xr 00 I 


It follows by L’H6pital’s Rule that 
vx +1 f(x) _ 


= jim = 
en FF 4 55 EX) 


0. 
The several forms of L’H6pital’s Rule which we have derived in this 
0 
section fall naturally into two types, symbolically denoted as the 0 type and 


the ~ type. Theorems (7.2), (7.3), and (7.4) are all examples of the first type, 
CO 


whereas the harder Theorem (7.5) is the prototype of the second type. The 
full power of the — forms will be realized later in the book in conjunction 
CO 


with the logarithmic, exponential, and trigonometric functions. 
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PROBLEMS 


1. Evaluate each of the following limits without using L’H6pital’s Rule. 


2 


@) ine or © tim 

) tim 28 i ore 

(©) lim = : (2) tn Gat +S 

(d) lim Sot ace (h) lim 
200 — +it-—1 


2. Evaluate each of the limits in Problem 1 using an appropriate form of 
L’HOpital’s Rule, if it is applicable. 
3. Evaluate each of the following limits. 


al a — — 


n a positive integer 


(0) tim St () tim att 

i haa AS Pt ee 
4. Compute 

© tim © tin ats 

© tm SP Se ae 


5. Compute each of the following limits directly using the * form of L’HOpital’s 
CO 
Rule. Verify the result by writing the quotient in a different form and using 


either the ; form of the rule or some other method. 


A 
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6. Suppose that F is a function differentiable on the open interval (0, ©) and such 


1 
that F’(x) = —, for every x > 0. Show that 
x 


oe 2 = 
(a) lim = a 0 (c) lim F(x) — 0, 
ee - ®—>00 bs 
F(x) for every positive integer 7. 


EIS 


CHAPTERS 


Conic Sections 


We shall now consider a certain type of curve called a conic section. 
Each of these curves is the curve of intersection of a plane with a right 
circular cone and each is also the curve defined by a second-degree equation. 
It is also true that any second-degree equation in x and y defines one of these 
curves or a degenerate form of one of them. We encounter all of them— 
the circle, the parabola, the ellipse, and the hyperbola—frequently in mathe- 
matics and also in the physical world. 


1. The Circle. We looked at a circle in Chapter 1 and have a definition from 
a first course in plane geometry. This is still the definition: A circle is the 
locus of points in a plane at a given distance from a fixed point. The given 
distance is called the radius and the fixed point is called the center. 

If the center of the circle is at (h, k), the distance from the center to a 
variable point (x, y) is, by the distance formula, \/(x — h)? + (y — k)?. 
If the radius is r, we have an equation of the circle given by 


VO = AP + = KP = 3. (1) 


An equivalent equation which is more commonly used is 


sO ora a (2) 


It is easy to see that, not only do all points at a distance r from (h, k) lie 
on the graph of (2), but also all points on the graph of (2) are at a distance r 
from (hf, k). 


EXAMPLE |. (a) Write an equation of the circle with center at the origin and 
radius 3. (b) Write an equation of the circle with center at (—1, 2) and radius 
en 


(a) By the distance formula, the first circle has equation 


V/(x — 0)? + (vy — 0) = 3, or VP+ys= 9, 
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(b) By the distance formula, an equation for the second circle is 
Vix — (-DP + ©& — 2)? = 5. Equivalent equations are 


Pet Saye 2)? 128. 
x? + y? + 2x — 4y = 20. 


As the following examples will show, any equation of the form 
ax’ + ay? + bx + cy +d=0 is, loosely speaking, an equation of a 
circle. The words “loosely speaking” are inserted to cover possible degenerate 
cases. For example, if a = 0 and 6 and ¢ are not both zero, the equation 
becomes an equation of a line. In another degenerate case “‘the circle’ may 


be just a point (if its radius is zero), and in another there may be no locus at 
all. 


ae 


EXAMPLE 2. Describe the graph of each of the following equations: 
(a) x° + y? — 6x + 8y — 75 = 0, 
(bt) x°+ y? + 12x — 2y + 37 = 0, 
(c) x + y— 4x —-5y+12=0, 
(d) 3x*+ 3y? — 9x + 10yv --H=0. 


The technique of completing the square is useful in problems of this 
type. In (a), we write equations equivalent to the given equation until we 
recognize the form. 


x —6x+ y?+ By = 75, 
x —6x+9+y4 8) + 16= 754+ 94 16, 
(x — 3)’ + (V+ 4)’ = 100. 


The graph is a circle with center at (3, —4) and radius 10. 
Applying the same technique to (b), we have 


x + 12x+ y? — 2y = —37, 
x*+ 12x+ 36+ y? —2y+1= —374+ 364+ 1, 
eae 7 = ly = 8. 


The graph is a circle with center at (—6, 1) and radius 0; i.e., it is just the 
point (—6, 1). We may say that the graph consists of the single point, al- 
though we sometimes describe it as a point-circle. 
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The equation of (c) gives different results: 

x — 4x + y? — Sy = —12, 
x—4x%+4+y? —S5y4+ 2 = -124+ 44 33, 
(x — 2+ (y — 3) = - 5. 

For any two real numbers x and y, the numbers (x — 2)? and (y — 3) 
must both be nonnegative, while —{Z is certainly negative. Hence there are 
no points in the plane satisfying this equation. However, the form of the last 
of the three equivalent equations is that of the equation of a circle, and we 
sometimes say that the graph is an imaginary circle with center at (2, 3) and 
radius 3\/ —7. 

Equation (d) requires a bit more manipulation: 


<] 
= 


3x? — 9x + 3y? + 10y 


| 
| 


12° 
x 3x t+ y+ Wey =H 

ea set ety ++ B= B+ Pt %, 
(- P+ 0+ =7. 


The graph is a circle with center at (2, —8) and radius \/7. 


By completing the square, as in the above examples, one can show that 
any equation of the form 


ax? + ay?>+ bx +cy+d=0 


is an equation of a circle of positive radius if and only if a #0 and 
b? + c? > 4ad. 

The circle is also the intersection of a right circular cone with a plane 
perpendicular to the axis of the cone. If the plane passes through the vertex 
of the cone, the intersection is a point. 

We can use the techniques of the calculus, as well as our knowledge of 
Euclidean geometry, to write an equation for a circle or for a line tangent toa 
circle, from given geometric conditions. 


EXAMPLE 3. Write an equation of the line which is tangent to the graph of 
(x + 3)? + (vy — 4) = 25 at (1,7). We may find the slope by use of the 
derivative, differentiating implicitly and remembering that one interpretation 
of the derivative is the slope of the tangent: 2(x + 3) + 2(y — 4)y’ = 0; 
x+3 1 +3 + 


oes The slope of the tangent is — 7 lige aaa We 


hence y’ = — 
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may also find the slope of the tangent by noting first that the radius to (1, 7) 
7-4 3 

ee — and then by remembering that the tangent is 

perpendicular to the radius and hence has slope —4. Thus the tangent line 

has equation y — 7 = —3(x — 1) or 4x + 3y = 25. 


has slope 


PROBLEMS 
1. Write an equation for each of the following. 
(a) A circle with center at the origin and radius 3. 
(b) A circle with center at the origin and radius 42. 
(c) A circle with center at (—2, 2) and radius 5. 
(d) A circle with center at (3, 0) and radius 3. ~ 
(e) Acircle with center at (0, —7) and radius 7. 
(f) A circle with center at (—3, —3) and radius 3/2. 


(g) A circle with center in the first quadrant, radius 4, and tangent to both 


axes. 
(h) A circle with center on the y-axis, radius 3, and tangent to the x-axis 


(there are two such circles). 


(i) A circle with radius 2 and tangent to the x-axis and to the line defined by 
the equation x = 5 (there are four such circles). 


2. For each of the following equations, describe the curve defined by it. 


(a) x? + y? = 64 ty Se | 
fb) x +o = 32 (g) x? + y? — 8x — 12y + 27 =0 
(ec) x” 4 ty = Ay =. 9 (h) 2x? + y2— 5x.- Ty +2 = 0 
(d) («+ 2)?+ y? = 16 (i) 9x? + Oy? — 12x + 30y = 71 


Pe Se ee ID a Oy = ee By = 3. 


3. Show that, if 62 + c? > 4ad and a # 0, the equation ax? + ay? ++ bx +cy + 
d = O defines a circle with center at 


b c | Fe + ¢ — dad 
— —, — — ) and radius . /———————_ 
2a 2a 4a? 


4. Show that the tangents from a point (x1, y;) outside the circle defined by 
(x — hy? + (Gy — k)* = r’ to the circle are of length 


SBA = RR OA = 2 F. 
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5. Show that the line tangent to the circle defined by 
ax’? + ay? + bx + cy +d = Oat (x, y:) has the equation 


b 
axnx tary + 5+) +50 +9) td =0. 


6. Write an equation of the line which is tangent to 
(a) the circle defined by x? + y? = 25 at (—3, 4). 
(b) the circle defined by x? + y? = 9 at (0, 3). 
(c) the circle defined by (x — 2)? + (y + 8)? = 169 at (7, 4). 
(d) the circle defined by x? + y? — 10y = 33 at (7, 2). 


7. Write an equation of the line containing the common chord of circles defined 
by x? + y? — 8x — 12y = 48 and x? + y? — 4x + 6y = 23. 


8. Given a circle and a line tangent to it, the segment of the line between a given 
point and the point of tangency is commonly called the tangent from the point 
to the circle. Show that the locus of points from which the tangents to two 
unequal externally tangent circles have equal length is the common internal 
tangent line. 


9. Write an equation for the circle which passes through 
(a) (3,4), (—4, 3), and (5, 0). 
(b) (7,1), (6, 2), and (—1, —5). 
(c) (4, 16), (—6, —8), and (11, 9). 


10. Use the results of Problems 3 and 5 to show that a line tangent to a circle is 
perpendicular to the radius drawn to the point of tangency. 


2. The Parabola. A second conic section, which appears in a great many 
physical applications, is the parabola. In this section we shall derive its 
equation and study its properties. 

By definition, a parabola is the locus of points in a plane equidistant 
from a given line and a given point not on the line. The line is called the 
directrix, and the point is called the focus. A simple equation for a parabola 
is found if a point on the x-axis is used for the focus and a line perpendicular 
to the x-axis and on the opposite side of the origin from the focus is used as 
the directrix. In Figure 1, the focus is at (a,0) and an equation of the 
directrix is x = —a. 

The distance from an arbitrary point (x, y) to the focus (a, 0) is, by the 
distance formula, \/(x — a)? + (vy — 0)?. The perpendicular from (x, y) 
to the directrix intersects that line at (—a, y), and so the distance from the 
point to the line is W(x + a)? + (G — y? = |x + a|. The point (x, y) 


SEC. 2] THE PARABOLA 137 


lies on the parabola if and only if the two distances are equal. Hence an 
equation of the parabola is 


V(x — a + y? = |x + al. 


An equivalent equation, which is simpler, is obtained by squaring both sides. 
We get 


x —2axtaty= x4 2ax4+ a, 


This equation has been derived in such a way that its graph contains all 


points and only those points equidistant from the focus and the directrix. 
y-axis 


Or 


parabola 


directrix 
i? = 4ax 


Figure | 


The point on a parabola nearest its directrix is called the vertex. For 
the parabola defined by y? = 4ax, the vertex is the origin, and there are no 
points on the curve to the left of the vertex. Since |y| increases indefinitely 
as x increases, the graph is not a closed curve but opens to the right. We 
cannot, of course, draw the entire curve, but the incomplete graph in Figure 1 
is sufficient to describe the entire curve for us. Note that the graph is sym- 
metric with respect to the x-axis. 

If we had chosen (—a, 0) for the focus and the line x = a for the direc- 
trix, we would have a parabola with equation y? = —4ax. This parabola 
would also have its vertex at the origin, also be symmetric with respect to the 
x-axis, but would open to the left. 
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With a horizontal directrix and the focus on the y-axis, the equation 
would be x? = 4ay or x? = —4ay, opening upward or downward, re- 
spectively. 

Thus the graphs of y? = kx and x? = ky are parabolas with their 
vertices at the origin. The line through the vertex and the focus is called the 
axis of the parabola. The graph of y? = kx has the x-axis for its axis and 
opens to the right or left, depending on the sign of k. The graph of x? = ky 
has the y-axis for its axis and opens upward or downward, depending on the 
sign of k. The absolute value of k determines the shape of the parabola. If 
k = O, the two equations reduce to y? = O and x? = O, whose graphs are the 
x-axis and the y-axis respectively. Thus a straight line may be regarded as a 
degenerate parabola. 


EXAMPLE |. Find the coordinates of the focus and an equation of the direc- 
trix of x? = —7y, and sketch its graph. An equivalent form of the equation 
is x° = —4(4)y, from which it follows that the focus is (0, —Z), that the 
directrix is y = 4, and that the graph opens downward. It is shown in 
Figure 2. 


y-axis 


directrix 


X-axis 


7 


focus ¢(9,—-4 


Figure 2 


EXAMPLE 2. Write an equation of the parabola with focus at (—3,0) and 
directrix x = 3. Sketch its graph. An equation may be found by use of the 
definition or by use of the formulas, either method giving y?> = —12x as the 
simplest equation. The graph is shown in Figure 3. 


In every case considered so far, the parabola has one of the coordinate 
axes for its axis and the focus and directrix are equally spaced on opposite 
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sides of the origin. The equations are somewhat more involved if other 
vertical or horizontal lines are chosen as axes, and even more involved if the 
parabolas have axes which are not parallel to one of the coordinate axes. 


directrix 


Figure 3 
EXAMPLE 3. Write an equation of the parabola with focus at (2,5) and 
directrix x = —6. Sketch the graph (see Figure 4). From the definition of 
y-axis 


(y —5)?= 16(x + 2) 


directrix 


Figure 4 
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the parabola, 


oe BEGG Dt SG 5, 
x + 12x + 36 = x — 4x +44 & = 5Y, 
léx + 32 = (y — 3. 
(y = 5Y = 16 + 2): 


Note the similarity of this form to the y? = 4ax form. The x-coordinate of 
the vertex, which is located halfway along the perpendicular from the focus 
to the directrix, is equal to —2. That is, it is found by setting x + 2 equal to 
zero. Hence the y-coordinate of the vertex is found by setting y — 5 
equal to zero. Thus the vertex is the point (—2, 5). 


Let us now write a general equation of an arbitrary parabola with a 
horizontal directrix (see Figure 5). Consider the perpendicular from the 
focus to the directrix, and denote the midpoint of this line segment by 


y-axis 


(x — h*) = 4a(y —k) 


focus ¢(h, k + a) 


(A, k) 


k directrix 
VHB Oe 


X-axis 


Figure 5 


(h, k). This is the vertex, the point on the parabola nearest the directrix. Let 
the focus be the point (A, k + a). Then the length of the segment is |2a| and 
the directrix is the line y = k — a. By algebra similar to that in Example 3, 
we go from 


to 
(x — h) = 4a(y — k). (1) 


This parabola will have the same shape and the same orientation (opening 
upward or downward) as x* = 4ay, but will have its vertex at (A, k). 
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Similarly, (vy — k)’ = 4a(x — h) will have the same shape and the same 
orientation as y> = 4ax, but will have its vertex at (A, k). 


EXAMPLE 4. Write an equation of the parabola with vertex at (—3, 2) and 
directrix y = 6. The midpoint of the segment connecting focus and directrix 
and perpendicular to the directrix is at (— 3, 4) and the segment is of length 4. 
With directrix above the focus, the parabola opens downward and has the 
equation 

cr 2) = ey = 4). 


EXAMPLE 5. Describe the graph of y? + 2x — 3y + 7 = 0. We first write 
y’ — 3y = —2x — 7 and, completing the square, add 2 to each side of the 
equation: 
ye sp PP 2, — 7 + 2, 
mes iia i adres 2 


The graph is a parabola with vertex at (— 42, 3), focus at (— 23, 3), and direc- 
trix x = —42. It opens to the left and has the same shape and orientation 
as the graph of py? = —2x. 


The graph of any function f defined by an equation f(x) = ax? + bx+ ¢, 
where a ¥ 0, is a parabola. The defining equation may be written 


or 
f(x) =a (: =f E) — mr Pen 


Hence the graph of the function, which is the graph of the equation y = f(x), 


4a 


yee 4 4ac — b° 
ose ast 


Comparing this equation with (1), we see that the graph is a parabola with a 
4 a biladqce oP 
vertical axis and its vertex at | — — , —————_} - 
2a 4a 


b\?  4ac — Bb 
is the graph of y = a (» + >. os ue Sars , Or, equivalently, of 
a 


PROBLEMS 


1. Write an equation for each of the following. 
(a) The parabola with focus at (—2, 0) and directrix x = 2. 
(b) The parabola with focus at (0, 3) and directrix y = —3. 
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10. 


ae 
sz 
=e 


CONIC SECTIONS  [cuHap. 3 


(c) The parabola with focus at (0, —1) and directrix y = 1. 

(d) The parabola with focus at (4, 0) and vertex at the origin. 

(e) The parabola with focus at (0, —2) and vertex at (0, 2). 

(f) The parabola with vertex at (—5, 0) and directrix x + 1 = 0. 


. Sketch the graph of each of the parabolas in Problem 1. 
. Sketch on the same set of axes the graph of each of the following equations. 


Compare and contrast the graphs, noting common features and differences. 
(a) y? = 5x (c) y? = 2x 
(b) yy>=x (d) y? = 3x. 


. Sketch on the same set of axes the graph of each of the following equations. 


Compare and contrast the graphs, noting common features and differences. 
(4) y= se = 2 
(b) y? = —2x (d) x? = —2y. 


. Consider a point (x1, yi) on the graph of y? = 4ax. 


(a) Find the slope of the tangent to the graph at (14, yj). 
(b) Write an equation of the tangent line in (a). 
(c) Show that yy; = 2a(x + x) is an equation of the tangent line. 


. Consider a point (xj, y;) on the graph of x? = 4ay. Show that xx, = 2a(y + yi) 


is an equation of the line tangent to the graph at (%, y1). 


. If (x1, y1) lies on the graph of y = ax? + bx +c, show that 3(y + yi) = 


axx, + $b(x + x1) + c is an equation of the line tangent to the graph at 
ere y)). 


. Write an equation of a line which passes through the point (8, 7) and is a tangent 


to the graph of y? = 6x. 


. (a) Find the point where the tangent to y? = 4ax at the point (x, yi) cuts the 


x-axis. Assume that a ~ 0. 


(b) Show that the segment of the tangent line between (xj, y;) and the point 
found in (a) is bisected by the y-axis. 


Write an equation for each of the following: 


(a) The parabola with vertex (1, 1) and directrix x = —1. 
(b) The parabola with vertex (1, 1) and directrix y = 0. 
(c) The parabola with vertex (4, 3) and directrix x = —2. 


(d) The parabola with vertex (—1, 2) and directrix y = 4. 


Find the focus of each of the parabolas in Problem 10. 

Do Problem 10 with each occurrence of the word ‘‘vertex”’ replaced by “focus.” 
Find the focus, vertex, and directrix of the parabola which is the graph of each 
of the following equations or functions. Sketch the graph. 


(a) y= x? (a) 9" + 2y + 2x = 0 
(b) y= x — 2 (ec) gy) = -—y-—4y +4 
cj fai = = pea (f} x? x + yp =O. 
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3. The Ellipse. A third conic section, quite fashionable these days as 
astronauts circle the earth in elliptical orbits, is the ellipse. In this section we 
shall derive its equation and study its properties. 

By definition, an ellipse is the locus of points in a plane the sum of whose 
distances from two given points is a constant. The constant, of course, must 
be greater than the distance between the two given points. The two points 
are called the foci of the ellipse. A simple equation for an ellipse is found if 
the points (—c, 0) and (c, 0) are used for foci and 2a for the constant sum of 
distances. The foci and the arbitrary point (x, y) are shown in Figure 6. The 


Figure 6 


distance from (x, y) to (—c, 0) is \/(x + c)? + (vy — 0)?, and that from 
(x, y) to (c, 0) is /(x — c)? + (y — 0)2. The point (x, y) lies on the ellipse 
if and only if the sum of these two distances is equal to 2a. Hence an equation 
of the ellipse is 


VETER + VG OFF = 2a. 


Simpler equivalent equations are found by subtracting ./(x — c)?4+ y’ 
from both sides of the equation 


4av/(x — ce)? + y? = 4a” — 4ex, 
dividing by 4 and squaring again 
a(x? — 2ex + C+ y*) = a’ — 2a@cx + cx’, 
and simplifying again, 


x(a — e+ ay = aa — c?). 
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Finally, dividing by a?(a® — c’) we have 


Since a > c, it follows that a? > c’, and we replace a” — c’ by b. We then 
have the canonical form of an equation for the ellipse, 


This equation is derived in such a way that its graph contains all points 
which satisfy the locus condition. One of the problems at the end of the 
section will be to show that the graph contains only those points. 


(0, —6) 


Figure 7 


By placing y equal to 0, we obtain x = +a. The graph therefore cuts 
the x-axis at (—a, 0) and at (a,0), and the numbers a and —a are the x- 
intercepts of the graph. The line segment between (—a, 0) and (a, 0) is called 
the major axis of the ellipse. Similarly, the graph cuts the y-axis at (0, —b) and 
at (0, b), and the numbers 5 and —6 are the y-intercepts of the graph. The 
line segment between (0, —b) and (0, 5) is called the minor axis of the ellipse. 
It is not difficult to see that there are no points on the graph for which 
|x| > aor for |y| > 6. Symmetry across both axes and across the origin can 
be seen by noting that (— p, g), (p, —q), and (—p, —q) all lie on the graph if 
(p,q) does. The complete curve is shown in Figure 7. 
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There is a simple method of constructing an ellipse by placing thumb- 
tacks at the foci and looping around them a string of length 2a + 2c. Ifa 
pencil is placed in the loop so as to hold it taut and moved in a complete 
turn around the foci, the curve traced is an ellipse. This is readily seen from 
the definition. 


EXAMPLE |. Describe the graph of 4x? + 25y? = 100 and draw it. An 
2 2 

equivalent form of the equation is 55 + 2 = |, from which it is apparent 
that the graph is an ellipse with x-intercepts of —5 and 5, major axis of length 
10, y-intercepts of —2 and 2, and minor axis of length 4. The intersection of 
these axes, in this case the origin, is called the center of the ellipse. Using 
a = 25, a? — c? = 4, we have c? = 21. Thus the foci are at (— 1/21, 0) and 
(\/21, 0). The graph is drawn in Figure 8. : 


Figure 8 


If the foci are located on the y-axis at (0, c) and (0, —c), an analogous 
derivation gives the equation 


2 

x = 
a a 
= fe 


and its equivalent form 


In this case, the major axis is the line segment between (0, —a) and (0, a), 
and the minor axis that between (—b, 0) and (5b, 0). 
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EXAMPLE 2. Describe and draw the graph of 4x” + y? = 36. An equivalent 
2 2 

form of the equation is = + = = 1, from which we see that the positive 

y-intercept is larger than the positive x-intercept and therefore that the foci 

are on the y-axis. The x-intercepts are 3 and —3, the y-intercepts are 6 

and —6, and the foci are at (0, ./36 — 9) and (0, —3\/3). The graph is 


drawn in Figure 9. 


y-axis 


Figure 9 


Examples 1 and 2 were each for an ellipse with its center at the origin. 
In a manner similar to that used in the last section, we can write an equation 
for an ellipse with center at (h, k) and foci at (h — c, k) and (h + c,k). The 
equation is then 
bei}. 
a? = b2 ae 
If the center is at (h, k) and the foci at (h, k — c) and (A, k + c), an 
equation 1s 
y- kh, @= A, 


a? b2 


ot? Ve 


e. 25 
The graph is an ellipse with center at (—4, 7). The foci are above and below 


EXAMPLE 3. Describe and sketch the graph of 
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the center, the distance being \/25 — 9 = 4. Thus the foci are at (—4, 3) 
and (—4, 11). The ends of the major axis are (—4, 7 — 5) or (—4, 2) and 
(—4,7 + 5) or (—4, 12). The ends of the minor axis are (—4 — 3,7) or 
(—7, 7) and (—4 + 3,7) or (—1, 7). The graph is drawn in Figure 10. 


(x+4)? G-7)_, 


Figure 10 


There is an alternative definition of the ellipse, one analogous to the 
definition of a parabola. It may be seen in considering the distance from a 
point (x, y) to the focus (c, 0). This distance is \/(x — c)? + y?. If the point 


x? 2 x20? é 
lesan the ellipse, then —.+ — sf ee ie Se a 
a os ¢ a? 
Thus the distance from the point to the focus is 
xc 
xt — rex +e + (e =e sg ‘ee. 
or 
xo 
a Dex a’, 
q?2 
or 
me 
——a 
a 
C 2 a 
This latter expression may be written —|x — —). But |x — —| is the 
a C é 
a 
distance from the point (x, y) to the line x = —. Thus the distance from the 


c 
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; ae Se ; : ‘ 
point to the focus (c, 0) is — times the distance from the point to the line 
a 


2 
x= Ee . This result could also be stated by noting that the ratio of the two 
e 


distances (that to the focus and that to the line) is a constant for all points 
on the ellipse. The line is called a directrix for the ellipse and the constant 


ratio, — , 1s called the eccentricity of the ellipse. The eccentricity is less than 1. 
a ; 


For all points on a parabola, the ratio of the two distances (that to the focus 
and that to the directrix) is also a constant, namely 1. A parabola is said to 


have eccentricity 1. Even as the ellipse has two foci, it also has two direc- 
2 


; ‘ : a ; : 
trices. The other is the line x = — —, and the ratio of the distance between 
Cc 


a point on the ellipse and the focus (—c, 0) to the distance from the same 
2 


C= C 
point to the directrix x = — — is the same constant -— - 
C a 


Not all equations for ellipses are in canonical form, but equivalent 
equations in canonical form can be found by factoring and completing the 
square. 


EXAMPLE 4. Describe and draw the graph of 16x? + 25y’ — 64x + 150y — 
111 = 0. Equations equivalent to this one are 
16(x’ — 4x) + 25(y + 6y) = 111, 
16(x* — 4x + 4) + 250" + 6y + 9) = 111 + 16-44 25-9, 
16(x — 2)” + 25(y + 3)” = 400, 
. (x— 2), @+3) 


Se i 


Zo 16 


From the last equation we can see that the graph is an ellipse with center 
at (2, —3), horizontal major axis of length 10, vertical minor axis of length 8, 
and c = 1/25 — 16. Thus the foci are at (—1, —3) and (5, —3). The graph 
is shown in Figure 11. 


As Example 4 illustrates, almost every equation of the type ax* + by? + 
cx + dy + e = Owithab > Ohasan ellipse for its graph. The reason for the 
“almost” can be seen as we write equivalent equations 


a(x +£x) +6(s° + 29) = —€, 
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oe C a d° sees 
a(x'+fx+ 2) +6('+ 44 4) =o ieee” 


C 2 d 2 Cc d’ 
a(x + £) +6(y+¥) = tigi lpg. 
If the expression on the right side of the last equation has the same sign as 
a (and hence 5), then the graph is an ellipse. If the expression on the right 


‘ d 
side is equal to zero, the graph is the single point (- a 5) = et ine 
expression on the right side has sign opposite to that of a, there is no graph, 
although the equation is said to have an imaginary ellipse for its graph. If 


a = b, the foci coincide and the graph is a circle. 


y-axis 


— 2)? + 3)? 
1) — : =. a 


Figure 11 


PROBLEMS 


1. Describe and sketch the graph of each of the following equations. Label the 
foci and the endpoints of the major and minor axes. 


2 My 2 2 
- icine oer 
oe a at oa a 
x y 2 y 
6) +4 =1 @) S+h=1 
x y 
eo ar 
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Write an equation for the ellipse satisfying the given conditions. 
(a) Foci at (—5, 0) and (5,0). Minor axis of length 24. 


(b) Center at the origin. Major axis horizontal and of length 14, minor axis 
vertical and of length 8. 


(c) Center at the origin. Minor axis vertical and of length 4. Passing through 
is, 8): 

(d) Foci at (—4,0) and (4,0). Endpoints of major axis at (—5,0) and 
(5, 0). 

(e) The locus of points the sum of whose distances from (0, 2) and (0, —2) 
is 7. 


. It has been shown that the distance between a point on the ellipse 


2 


=1 


9 


ae a? — ¢c? 


and the focus (c, 0) is a 
a 


ee ; c 
(a) Show that this distance is a — ~ for |x| < a. 
a 


(b) Show that the distance between a point on the ellipse and the focus 


(—c, 0) is * +a 
a 


xe 
and that the distance is a + —- 
a 


(c) Show that the sum of the distances from a point on the ellipse to the foci 
a 
a” — 


2 
is 2a and hence that the graph of ~ — ‘ea 1 contains only those 
a 


points which satisfy the locus definition. 


. Describe and sketch the graph of each of the following equations. 


= 27 2 — 4 


~ 2 = 
(a) — i 9 == | (d) 25x + 9—Y + 3) 25 
2 2 
(b) et) OF a1 (e) 9x? + 4y’ + 36x — 24y + 36 =0. 
poy, Ds ce Oy 
169 144 


. The line segment which passes through a focus, is perpendicular to the major 


axis, and has its endpoints on the ellipse is called a latus rectum. 
(a) Find the length of a latus rectum of the ellipse 4x” + 9y? = 36. 


(b) Find the length of a latus rectum of the ellipse b’x? + a®y? = a®b’. 
(Assume that b < a.) 


(c) Show that both latera recta of an ellipse are the same length. 


SEC. 4] THE HYPERBOLA 151 


6. 


10. 


11. 


iz. 


tr. 


Write equations of the directrices and find the eccentricity of each of the 
following ellipses. 


(a) 4x? + 9y? = 36 (b) 9x? + 4y? = 144. 


. Assume that 0 < c < a. 


(a) Find the distance between (x, y) and (—c, 0). 


ve 
(b) Find the distance between (x, y) and the line x = — 4 
Cc 


(c) Find the locus of points (x, y) such that the ratio between the distance in 


C 
(a) and the distance in (b) is a constant -— - 
a 


. Show that an ellipse becomes more nearly circular as its foci get closer and 


closer together. 


. Consider a point (x1, y1) on the graph of 62x? + a’y? = a?b’. 


(a) Find the slope of the tangent to the graph at (x1, y). 
(b) Write an equation of the tangent line in (a). 
(c) Show that b?xx, + a’yy,; = a’b? is an equation of the tangent line. 


Assume that the constants a, 5, c, d, and e are such that ax? + by? + cx + 
dy + e = Oisan equation of an ellipse. Consider a point (x1, y1) on this ellipse. 


(a) Find the slope of the tangent to the graph at (14, y;). 
(b) Write an equation of the tangent line in (a). 


(c) Show that axx,; + byy. + $c(x + x1) + 3d + y) +e =0 is an 
equation of the tangent line. 


Write an equation of the ellipse with horizontal and vertical axes satisfying the 
given data. 


(a) Foci at (—3, 2) and (5, 2). Eccentricity is 2. 
(b) Center at (2, —1). One focus at (2, 2). Point (2, 4) lies on ellipse. 


(c) Ends of major axis at (2, 4) and (12, 4). Ends of minor axis at (7, 2) and 
(7, 6). 


In the definition of the ellipse on page 143, we asserted that the constant must 
be greater than the distance between the foci. What is the locus of points in the 
plane the sum of whose distances from (—c, 0) and (c, 0) is the constant 2c? 
What must be the relation between a and 5 for the major axis of the ellipse 
= +- y = 1 to be horizontal? Vertical? 
Yo 


4. The Hyperbola. The fourth and last conic section is the hyperbola. By 
definition a hyperbola is the locus of points in a plane the absolute value of 
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the difference of whose distances from two given points is a positive constant. 
The constant must be less than the distance between the two points, since the 
length of one side of a triangle must be greater than the absolute value of the 
difference between the lengths of the other two sides. The two given points 
are called the foci of the hyperbola. 

If we select (—c, 0) and (c, 0) as foci and 2a as the difference of distances, 
the point (x, y) will lie on the hyperbola if and only if 


MO = oP Ee = VO er 2a. 
This equation is an abbreviation for the two equations 
V@— oP Fy - Ve Forty = 2a, 


VO fey +? — 4/7 — OF = Ze. 


We shall simplify the first of these two equations and leave as Problem | at 
the end of the section the proof that a similar simplification of the second 
results in the same equation. 

The steps are similar to those for the simplification of the defining equa- 
tion of an ellipse: 


V(x = ch + yy? = 2a+- VO + oP + y?, 
x — ext e+ y’ = 4a’ + 4a/(x +c)? + yy? + x’ + 2ex toe + y, 
—4cx — 4a’ = 4ar/(x + ce)? + y?, 
e’x” + 2a’cx + a’ = a(x’ + 2cx +c’ + y’), 


a’(c” — wy, 


(c’ = a’)x” ie ay 


and finally 


This last equation is closely akin to the equation of an ellipse; however, 
Z 2 


instead of 
Ce — A Pe 
case that 2a < 2c and so a < c, and therefore Cc — a >0. Thus ts 


c? — a’ which we replace by b’ to obtain the canonical equation for the hyper- 
bola 


the second term is . For the hyperbola, it is the 


We know, from the derivation, that the graph of this equation contains all 
points (x, y) such that the distance between (c, 0) and (x, y) is 2a more than 
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the distance between (—c, 0) and (x, y). In Problem 1 you will be asked to 
show that the graph also contains all points (x, y) such that the distance 
between (—c, 0) and (x, y) is 2a more than the distance between (c, 0) and 
(x, y). In Problem 3 you will be asked to show that the graph contains only 
those points. 

By setting y equal to 0, we see that the graph cuts the x-axis at (—a, 0) 
and (a, 0). These points are called the vertices of the hyperbola and the 
line segment joining them the transverse axis. By writing the equivalent 


9 


equation y” = — (x — a’), we see that there are no points on the graph for 
2 


|x| <a. Hence the graph cannot cut the y-axis. The curve is infinite in 
extent, since there are points on the graph for all x such that |x| > a, and |y| 
increases indefinitely as |x| increases. The curve is symmetric with respect to 
both axes and to the origin, since (—p, gq), (p, —q), and (—p, —q) all lie on 
the graph whenever (p, g) does. Although the graph is infinite in extent, the 
central portion of it is sketched in Figure 12. When we ask for the sketch of a 
hyperbola, it is the central part which is to be drawn. 


y-axis 

ee ee 
a2 hb? 

vertices 

(7,0) (c, 0) 
—" eo 7 
foci a eae a 
Figure 12 


b 
The graph of y = —+/x? — a? is the upper half of the right branch of 
a 
the hyperbola for x > a and the upper half of the left branch for x < —a. 


ot ee b | See eee 
In each case, since |x| > ./x? — a?, it is true that —|x| > {VY — @. 
a 


b ee ee 
However, the difference between the two functions - |x| and ve — @ 
a 
gets less and less as |x| increases. Thus the upper half of the hyperbola lies 


b : a 
below the graph of y = — |x| but gets closer to it as |x| increases. Similarly, 
a 


ee eee ee 
the graph of y = — —+/x? — a? is the lower half of the hyperbola and it lies 
a 
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b 

above the graph of y = — , |x|, but gets closer to it as |x| increases. The 

. b b 
union of the two graphs, those of y = — |x| and y = — — |x|, is also the 

a a 

' b b 
union of the graphs of the straight lines y = — x and y = —-x. These 

a a 


lines, approached by the hyperbola, are called the asymptotes of the hyperbola. 
They are of use in sketching the graph of a hyperbola, giving guidelines ap- 
proached by the hyperbola. They are easily drawn as the diagonals of the 
rectangle with vertical sides passing through the vertices of the hyperbola and 
horizontal sides passing through (0, —b) and (0, 5). 


x? a 


i 
a = 4,b = 3,andc = 5. The transverse axis is of length 8, the x-intercepts 
are —4 and 4, the foci are at (—5, 0) and (5, 0), and the asymptotes are the 
lines y = #xandy = —#x. A very useful device for remembering equations 
of the asymptotes is obtained by replacing the “‘l’’ in the equation of the 
hyperbola by “0”: 


EXAMPLE |. Describe and sketch the graph of = 1. Here we have 


x? ¥ 
ie oe 
which is equivalent to 
cana; Cogeeeera ee 
(3 = *) (; ,) 
y-axis 
y=-4x | a 3 
en kt a ae 
| | a 
| | 
(—4, 0) (4, 0) 
i350) (5,0) x-axis 
| | 
| | 
ee = 


Figure 13 
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The graph of this equation is the union of the graphs of ; + ; = 0 and 


: — : = 0, equations of the asymptotes. The graph is sketched in Figure 13. 
If the foci of the hyperbola are on the y-axis, at (0, —c) and (0, c) the 
2 ze 
equation will bes — : = |. The graph will have branches opening upward 
and downward instead of to the right and left. The lines y = ai and 
y=- 0 will be asymptotes. 


EXAMPLE 2. Describe and sketch the graph of 4y” — 5x? = 20. An equivalent 
2 ys 
equation A er a 1, from which we see that the vertices are at (0, —\/5) 


and (0, \/5). The transverse axis is vertical and of length 2\/5. The foci are 
at (0, —3) and (0, 3) and the lines y = —4\/5x and y = 4\/5x are the 
asymptotes. The branches open upward and downward, as can be seen in 
Figure 14. 


y=t4y5 x 


Figure 14 
x? y y’ x? 
The equations — — ac 1 and eget | have the same asymptotes, 
a a 


b b 
the lines y = — x and y = — — x, and they are called conjugate hyperbolas. 
a a 
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The first mentioned has its foci at (—c, 0) and (c, 0) and the latter has its foci 
at (0, —c) and (0, c), where c = \/a? + b?. The transverse axis of each is 
called the conjugate axis of the other. 

Examples 1 and 2 have each been a hyperbola with its center at the origin. 
The foci, however, may be anywhere in the plane and the midpoint of the 
segment joining them will be the center. If they are on a line parallel to one 
of the axes, an equation analogous to that of the ellipse can be written. In 
particular, if the foci are at (A — c,k) and (A + c,k), an equation of the 
Greed iae ie Say 


hyperbola is = 1, and an equation for the asymptotes 


a’ b? 
— hy — ky 
se” = ons = 0. The latter equation is equivalent to the separate 
b b : 
equations, y — k = ae — h)andy — k = — -(x — A). If the foci are at 
a 


(h,k — c) and (h, k + c), an equation for the hyperbola is 
Quart «7 Grade eg 


a2 b2 
y-axis y-1=4(x+2) 
| 
| 
| 
| 
| Grey Va] 
1 9 16 
| 
(<h1) W5, eh) si py 64) 
fe 


= 


\ X-axis 


| 
| 
| 
| 
| 
| 


Figure 15 


(+2 C= eae 

9 6 
The center of the hyperbola is at (—2, 1), the foci are at (—7, 1) and (3, 1), 
and the transverse axis is horizontal and of length 6. The asymptotes have 
equations y — 1 = 4(x + 2) and y — 1 = —$(x + 2). The graph is 
sketched in Figure 15. 


EXAMPLE 3. Describe and sketch the graph of 
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There is a focus-directrix definition of the hyperbola, analogous to those for 

the parabola and the ellipse. The distance from a point (x, y) to the focus 

(c, 0) is V(x — c)? + y?. The point lies on the hyperbola if and only if 

x? 2 fe 

ee =loyv=@-cC-xX+ = x’. Thus the distance from a 
a 


g? oO — @ 
point on the hyperbola to the focus is 
e Cc” 
vote ter+ (ac - 8+ Sx) = a 2 
V(Gs= 4) 
= —-x-—a 
a 
= |-x—a 


: ; ate C a ae 
As with the ellipse, this distance is equal to— |x — —| , or— times the distance 
a e a 
2 


: a ee : - ae 
to the line x = —. This line is again called the directrix, and the ratio — the 
C a 


eccentricity. However, for the hyperbola the eccentricity is greater than 1. 

The hyperbola can therefore also be defined as the locus of points the ratio 

of whose distances to the focus and to the directrix is a constant greater 
re a’ 

than 1. Corresponding to the focus (—c, 0) is a second directrix x = — —- 
C 

Not all hyperbolas with horizontal or vertical axes appear with equations 

in canonical form. But canonical equations can be found for them by factor- 


ing and completing the square. 


EXAMPLE 4. Describe and sketch the graph of 16x* — 9y? — 32x — 54y — 
641 = 0. Equations equivalent to the given equations are 


16(x° — 2x) — 9(° + 6y) = 641, 

16(x° — 2x + 1) — 9 + 6y + 9) = 641 4+ 16-1 — 9-9, 
16(x — 1)’ — 9(y + 3)? = 576, 

ety + 3) 


ee ee 


36 64 


The hyperbola, opening to the right and left, has its center at (1, —3), its 
vertices at (—5, —3) and (7, —3), its foci at (—9, —3) and (11, —3), and the 
lines y+ 3 = $(x — 1) and y+ 3 = —4(x — 1) for asymptotes. The 
graph is sketched in Figure 16. 
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y-axis 
| @—-1¥) on a 
7 36 2 ls 
< X-axis 
(—5,-3) S (1,-3) (7, -3) 
— —@ eee ee — ee eC CU COC  ™®@O ECU @ eee 
es | (Hea) 
| 
| asymptotes: 
| yt3=4400-1) 
| 
Figure 16 


The method of Example 4 can be used to show that every equation of the 
type ax? — by? + cx + dy + e = 0 with ab > 0 has for its graph either a 
hyperbola or a pair of intersecting straight lines. Equivalent to the given 


equation are 
2 Cc 2 d 2rieg 
a(x +x) —6(°-%) e, 
2 2 Z 2 
alee Cc po ee d Cc .. 
a(a*+£x4 6) — 6 (0 -$y + ds) = fa 4b 


C fs a" Cc d’ 


If the expression on the right side of the equation has the same sign as a and 


| 
| 
me 


d 
b, the graph is a hyperbola centered at (- = ; 5) opening to the right and 
to the left. If the expression has sign opposite to a and b, the graph is a 
d 
hyperbola centered at (- = <) but opening upward and downward. If 
a 


the expression is zero, the graph consists of the two straight lines 


d a Cc d a Cc 
y- f= (e+ 8) - y- f= —J8(x+ £). 


One particular type of hyperbola with its axes neither horizontal nor 
vertical has a simple equation and appears frequently in mathematics. It is 
the hyperbola with foci at (—a, —a) and (a, a) and the difference of distances 
equal to 2a. By definition, a point lies on this hyperbola if and only if 


V@— oF +O — a — V+ a + OF ay = 20. 
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Squarings and simplifications yield the equation 


4 2 
AY = on . 


This hyperbola has the coordinate axes for its asymptotes and is drawn in 


Figure 17. Its vertices are at (- ave, — 5 v2) and (; 2 v2): 


y-axis 
(a, a) xy= ae 
X-axis 
f—g, —a) 
Figure 17 
With foci at (—a, a) and (a, —a), the equation is xy = —4a? and the two 
branches lie in the second and fourth quadrants. This type of hyperbola is 
x? 2 2 x? 
called equilateral, as are the hyperbolas — — . = | and aa me 1 
a a 
witha’: = »b*, 
PROBLEMS 
1. Simplify /( + c)? + y2 — V(x — c)? + y? = 2a and show that it results 


2 2 
tee 
in the equation — — = J 
ae C2 ie a 
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2. Describe and sketch the graph of each of the following equations. Label the 
foci and vertices and sketch the asymptotes. 


2 2 
x | ae = 

(a) 9 = 1 ehsuy 9x 220 
y x’ 

(b) a7 7! (f) Mas 
a y 

(c) 25 144 = ] (g) xy = —6. 


2 
Xx y ae: 
(d) (ee ae : 


3. It has been shown that the distance between a point on the hyperbola 
x2 : 


. Call this distance qd. 


- = 1 and the focus (c, 0) is E jee 
a 


ae Cc se ge 
(a) Show that the distance d) between a point on the hyperbola and the focus 
(—c, 0) is cs + 7 
a 
(b) Show that x > a for a point on the right branch of the hyperbola and that 


Cc 
for such a point d; = “x —aandd, = -x+ a. 
a a 


(c) Show that x < —a fora point on the left branch of the hyperbola and that 


Cc Cc 
for such a point dj = — —x + aandd = — -x — a. 
a a 
x a 
(d) Hence show that the graph of oe 3 “s = 1 contains only those 
a co—a 


points which satisfy the locus definition of hyperbola. 


4. Write an equation for the hyperbola with horizontal and vertical axes satisfying 
the given conditions. 


(a) Foci at (—5,0) and (5,0). Transverse axis of length 6. 


(b) Center at the origin. Transverse axis horizontal and of length 4, conjugate 
axis vertical and of length 12. 


(c) Center at the origin. Transverse axis vertical and of length 6. Passing 
through (1, 2/3). 


(d) Center at the origin. Passing through (1, 5) and (2, 7). 


5. (a) Show that the graph of (x — 3)(y + 2) = 10 is an equilateral hyperbola 
with center at (3, —2) and asymptotes x = 3 and y = —2. 


(b) Sketch the graph described in (a). 
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6. Describe and sketch the graph of each of the following equations. Label the 


10. 


11. 


foci and vertices and sketch the asymptotes. 


Ge Ty i 3 


(a) ae 9 =| (e) 2xy — 4x — 4y —17=0 

2 2 
(b) oe _& Vay Qe x69 3 Wick vy SS 
(c) (x — 1) 4+ 3) = -12 (ior = Ae ae = i 8 = 


(d) 144(x + 4)2 — 25(y + 2)? = 3600 


. Hyperbolas with the same pair of foci are said to be confocal. Show that the 


following equations have confocal hyperbolas for their graphs and sketch them 
all on the same set of axes. 


2 ve 2 = 
2 S eee x a 2 a 
a (o) ete o 
x y ” ; 


. The line segment which passes through a focus, is perpendicular to the transverse 


axis extended, and has its endpoints on the hyperbola is called a latus rectum. 
(a) Find the length of a latus rectum of the hyperbola 9x? — 16y? = 144. 
(b) Find the length of a latus rectum of the hyperbola 62x? — a?y? = a?h?, 
(c) Show that both latera recta of a hyperbola are the same length. 


. Write equations of the directrices and find the eccentricity of each of the 


following hyperbolas. 
(a) 9x? — 16y? = 144 (b) 144y? — 25x? = 3600. 


Assume that 0 <a <. 
(a) Find the distance between (x, y) and (—c, 0). 
2 
(b) Find the distance between (x, y) and the line x = — = 
Cc 
(c) Find the locus of points (x, y) such that the ratio between the distance in 


c 
(a) and the distance in (b) is a constant -— - 
a 


Consider points on the upper right branch of the hyperbola x2 — y? = 16 and 
on its asymptote x = y. Find the y-coordinates of points on each for x = 10, 
100, 1000, 10,000, and show that the vertical distance between corresponding 
points is decreasing as x increases. 
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12. 


13. 


14. 


15. 


16. 
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Consider a point (x1, v1) on the graph of b?x? — a’y? = a*b’. 

(a) Find the slope of the tangent to the graph at (11, 1). 

(b) Write an equation of the tangent line in (a). 

(c) Show that b?xx, — a’yy; = a’b? is an equation of the tangent line. 


Consider a point (x1, y1) on the hyperbola ax? — by? + cx + dy +e = 0 
with ab > 0. 


(a) Find the slope of the tangent to the graph at (11, 1). 
(b) Write an equation of the tangent line in (a). 


(c) Show that axxi — byy: + $cx + x) + $d(y + y) +e = 0 is an 
equation of the tangent line. 


Show that the product of the distances of a point on the hyperbola xy = —12 
to its asymptotes is a constant. 


If the difference of the distances of the point (x, y) from two foci is zero, show 
that the locus of (x, y) is the perpendicular bisector of the line segment joining 
the foci. 


Describe and sketch the graph of each of the following equations. If the graph 
is a circle, give its center and radius. If the graph is a parabola, give its focus, 
directrix, vertex, and axis. If the graph is an ellipse, give its center, foci, 
directrices, eccentricity, and length of major and minor axes. If the graph is an 
hyperbola, give its center, foci, directrices, eccentricity, asymptotes, vertices, 
and length of transverse axis. 


(a) x? + y?> + 6x + 4y = 12 


(b),.x° + 4% + 6x + 4y + G0 
(c) x +6x+4y4+2=0 
(d) x* — 4+ 6x +4y +4=0 


(e) 47+ 6x + 4y + 13 = 0 
(f) xy + 6x + 4y = 3 

(g) 3x? + 3y? + 6x — 18y = 162 
(h) 4y? — x7 + 6x + 4y = Il 
Gig ais Sern peo 

(i) 7 i= fy ees 


CHAPTER 4 


Integration 


There are two major topics in calculus, differentiation and integration. 
The theory of integration at first appears to have little connection with 
differentiation. However, we shall see that the two processes are indeed 
closely related. 

Differentiation deals with derivatives of real-valued functions, integra- 
tion with integrals of real-valued functions defined on closed intervals. In 
this chapter we shall define and study the definite integral from a to 6 of a 
real-valued function f, which is denoted by [- f. The integral has countless 
applications both in mathematics and the sciences. One of the most important 
enables us to find the area of a region more involved than those studied in 
plane geometry. If f(x) > 0 for every x in [a, 6], then we shall see that is 7% 
equal to the area of the region lying above the x-axis, below the graph of 2 
and between the vertical lines x = a and x = b (see Figure 1). Thus the 
areas of many irregularly shaped regions can be found by integration. 


b 


Figure | 


Another interesting application enables us to find the distance traveled 
by a particle which moves in a straight line. If the particle moves with a 
velocity equal to u(f) at time ¢, then the distance traveled during the interval 
of time from t = a to t = bis given by the integral re lv}. 

It will be useful to recall some of the elementary ideas and notations of 
set theory. In particular, the union of two sets P and Q, denoted by P U Q, 
is the set of elements that belong either to P, or to Q, or to both. The inter- 
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section of P and Q, denoted by P - Q, is the set of elements that belong to 
both P and Q. If P and @Q have no points in common, they are said to be 
disjoint. The set which contains no elements is called the empty set and will be 
denoted by @. Thus P and Q are disjoint if and only if PQ = @. Finally, 
the difference, P — Q, is the set of all elements of P which do not belong to Q. 


1. The Definite Integral. In defining the definite integral, we shall use the 
concepts of bounded sets discussed in Section 1 of Chapter 1. Recall that a 
number uw is said to be an upper bound of a set S of real numbers if the in- 
equality x < wis satisfied for every number x in S. Thus the numbers 100, 5, 
and | are all upper bounds of the closed interval [0,1]. The Least Upper 
Bound Property (see page 7) states that every nonempty set of real numbers 
which has an upper bound has a least upper bound. For example, the number 
1 is obviously the least upper bound of the interval [0, 1]. Note that 1 is also 
the least upper bound of the open interval (0, 1). 

In the same way, a number / is called a lower bound of S provided /] < x 
for every x in S. The Greatest Lower Bound Property (see Problem 11, 
page 9) similarly asserts that if S is nonempty and has a lower bound, then 
it has a greatest lower bound. Finally, a set is simply said to be bounded if it 
has both an upper bound and a lower bound. 

The notion of boundedness can be applied to functions. Specifically, a 
real-valued function f of a real variable is said to be bounded on an interval J 
if the following two conditions are satisfied: 


(1) J1is a subset of the domain of f. 
(1) There exists a real number k such that | f(x)| < k, for every x in J. 


The reader should be able to supply the straightforward argument which 
shows that condition (ii) is equivalent to the assertion that the set S of all 
real numbers f(x) for which x is in J is a bounded set. To illustrate the 
terminology, consider the two functions f and g defined by f(x) = x2 and 


l 
g(x) = —. The former is bounded on both the closed interval [0, 1] and the 
x 


open interval (0, 1), whereas the latter is bounded on neither. 

Let [a, 6] be a closed interval, and let o@ = {x,..., Xn} be a finite 
subset of [a, b] which contains the endpoints aand b. The set o subdivides, or 
partitions, the interval into subintervals, and, for this reason, we shall call it a 
partition of [a, b]. 

Let f be a function which is bounded on the closed interval [a, b], and 
leto = {X,..., Xn} be a partition of [a, b] in which 


@ = Xy hy Xe Se ee 


Since fis bounded on the entire interval [a, 5], it is certainly bounded on each 
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subinterval. (x29, 34]. for = Ays..5 n- Hence the:set S consisting of all 
numbers f(x) with x in [x;_1, x;] has a least upper bound, which we denote by 
M;. Similarly, of course, the set S has a greatest lower bound, which we 
denote by m;. These numbers are illustrated for a typical subinterval in 


a x; x-axis 


Figure 2 


Figure 2. We now define two numbers U, and L,, called, respectively, the 
upper sum and the lower sum of f relative to the partition o by the formulas 


M(x — <a) + M (x2 — 0 a ae 2 M, (Xn ¥ cee 


= m,(X; — Xo) + mo(x2 — X41) $8 + Mn(Xn — Xn—1)- 


Since M,; > m; and since (x; — x;1) > 0, for eachi = 1,...,7n, it follows 
that 


U, — Le (M, = m)(X1 ze Xo) <2 (M, r= M)(X2 sas xi) > 


i (M,, = Mi NK, os, aw) = 0. 
Hence, we conclude that 


| a en 6 C2) 


1 
EXAMPLE 1. Let f be the function defined by f(x) = a ict la, Or = TE, oF 


and consider the partition o = {1, 3, 2,3, 3}. There are four subintervals, 
[1, 2], (8, 21, [2, 3], and [8, 3], and each one is of length 4. It is clear from 
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Figure 3 that the maximum value of f on each subinterval occurs at the left 
endpoint, and the minimum value occurs at the right endpoint. Hence 


M=f=1, m=ff@ = 3, 
M, =f@)= 43, m= f(2) = 3, 
M; ={@) = 2 = f@ = 5, 
M=f/@=% m=fG3)=3 
It follows that 
Ur=VE+E EERSTE 
Ly = PR+E-E4+2-44+8-4 


y-axis 


Figure 3 


Continuing the computation, we obtain 


- 7 
Ue = Z(1 +3454 2) 3 OtMEB TD _T, 


2 2 2 30 ~— 60 
12} pds) At Wer 5 2+ 10 ee 
b= 5(54+3 +545) -5 tes ~ 60 


for the values of the upper and lower sums of f relative to oc. 


In the paragraph preceding Example |, it is proved that, for a given par- 
tition a, the lower sum is less than or equal to the upper sum. We shall now 
prove the much stronger fact that all the lower sums are less than or equal to 
all the upper sums. More precisely, 


(1.1) Let f be bounded on [a,b]. If o and Tr are any two partitions of [a, b], 
hen LAS v,. 
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Proof. The argument will be divided into three parts. 


(i) If r is obtained from o by adjoining just one new number y, then 
L, SiS ie ve 


ieee = 2 a ON Ge = Xp Sy SS SX, = B, and let the 
upper and lower sums U, and L, be defined as in (1), page 165. We shall 
assume that y lies in the Ath subinterval [x;_1, x,]. Then 


tae a ee ae re oe 
and 
eg  S  . 


Abbreviating the expressions least upper bound and greatest lower bound by 
l.u.b. and g.l.b., respectively, let 


Mi. = 1.u.b. of the set of all numbers f(x) with x,-1 < x < y, 


Miz’ 


l.u.b. of the set of all numbers f(x) with y < x°< x, 


g.l.b. of the set of all numbers f(x) with x,-1 <x < y, 


mi, 


mi, = g.l.b. of the set of all numbers f(x) with y < x < xz. 


These numbers are illustrated in Figure 4. The key idea in the entire proof is 
the fact that the least upper bound of fon any set S is greater than or equal to 


a y Xe x-axis 


Figure 4 


its least upper bound on any subset of S, and, similarly, the greatest lower 
bound of fon S is less than or equal to its greatest lower bound on a subset 
of S. This means that 


M, > Mi, and M, > Me, 


my < mM, and m, < mi, 
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as is borne out by Figure 4. Hence 


Mi (xn — Xe—-1) = Milxe — y) + Mi(y — xz-1) 


Mixx — y) + Miy — xz—1), 


V 


mi(X~ — Y) + mel(y — X¢—1) 
< me (xe — y) + mi(y — Xx-1). 


| 


my (Xf = Kia 


But the other terms in the upper and lower sums are the same for the two 
partitions. We conclude that U, > U, and that L, < L,. Since it follows 
from (2) that L, < U,, we obtain 


Le ene ole (3) 
(ii) If o is a subset of r, then the preceding inequalities (3) are still satisfied. 


This follows by repeated applications of part (i), since the partition 7 
can be obtained from o by adjoining one number at a time. 


(ii) If o and tr are any two partitions, then L, < U,. 


Both o and 7 are subsets of the partition o U 7 consisting of their union. 
Hence, one application of part (ii) gives L; < Loy,, and another application 
yields U,y, < U,. Combining these inequalities with L,y,< U.y;, we 
obtain 

ie < Leis < Usiir s Os; 


and the proof of (1.1) is complete. 


Theorem (1.1) states that, for a given function f bounded on an interval 
[a, b], if we consider all partitions of [a, b], then every lower sum is less than 
or equal to every upper sum. It is instructive to picture the relative positions 
of these numbers on the real line. If we indicate each lower sum by a right- 
hand parenthesis, “‘)”, and each upper sum by a left-hand parenthesis, ‘‘(’’, 
the situation looks as shown in Figure 5 (except that in general there are 


upper sums 
"7 SVS XW ———_ 
real line 
TY 4 
lower sums 
Figure 5 


infinitely many sums of both kinds). The question naturally arises as to the 
existence of numbers in between the two sets, and this brings us to the 
definitions of integrability and of the definite integral: Let the function f be 
bounded on the closed interval [a,b]. Then f is said to be integrable over 
[a, 5] if there exists one and only one number J such that 


L,<J< U,, (4) 


sec. 1] THE DEFINITE INTEGRAL 169 


for any two partitions o and 7 of [a, b]. If fis integrable over [a, b], then the 
unique number J is called the definite integral of f from a to b, and is denoted 


by { f. Thats, : 
ae 


Almost all the functions encountered in a first course in calculus are 
integrable over the closed intervals on which they are bounded. The reason 
is that for these functions the differences between the upper and lower sums 
can be made arbitrarily small by taking partitions which subdivide the in- 
terval into smaller and smaller subintervals. Many conditions which ensure 
that a function is integrable are known. Among these, we shall consider two 
[see Theorems (3.3) and (5.1)]. The second condition is continuity. We shall 
see that if f is continuous at every point of a closed interval [a, 5], then f is 
integrable over [a, 5]. 

If fis a function bounded on [a, b], there are, according to the definition, 
two conditions which must be satisfied for f to be integrable over [a, b]. The 
first is that there must exist a number J such that the inequalities (4) hold 
for all partitions o and 7 of the interval. The second is that there must be 
only one such number. It is not hard to prove that the first condition is 
always satisfied (see Problem 9 at the end of this section). It is the second 
which may fail, as the following example illustrates. Let f be the function 


defined by 
if x is rational, 


if x is irrational. 


f=) 


This function is bounded on the interval [0, 1]. However, if o is any partition 
whatever of [0, 1], it is easy to see that U, = 1 and L, = 0. This means that 
every number J between, and including, 0 and 1 will satisfy (4). The function 
is therefore not integrable because J is not unique. 


l 
EXAMPLE 2. Assuming that the function f defined by f(x) = — is integrable 
over the interval [1, 3], prove that P 


3 
ws | £< 


This is the function and interval described in Example 1. For the partition 
o = {1,2, 2,5, 3}, we saw that L, = 34 and U, = %; hence the integral is 
bounded by these two numbers. 


EXAMPLE 3. Consider the function f defined by f(x) = x°. Assuming that f 
is integrable over the interval [0, 1], show that 


i 
asc] fs 


oy 
We) at 
on 
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We use the partition o = {0, 3, 2, 2, 4, 1} and compute the upper and lower 
l 


sums U, and L,. The points of the partition are given by x; = 5? for 
} =-Q, -.., 35. Hence 
oe ee fers =, 2, 5. 


Since in this case the subintervals are all of length 4, equations (1) can be 
simplified to read 


U, = Mi + +++ + My), 
Le = (m+ --- +m). 


It is clear from Figure 6 that the maximum value of f on each subinterval 
[x;_1, X;] occurs at the right endpoint, and the minimum value occurs at the 


Figure 6 


left endpoint. Hence M; = f(x;) and m; = f(x;_1). Since f(x) = x2, we 
have 


eee 
Dae 2 36, 5 a : 
S Sess see 5 eer ii 
ee oe ee 


25 
Thus 


Pius 2. 2 ee. 
= 3(Et+e tet K+) 
ie sieml: fee eee, oe 
Le = 5 (55 +35 + 35 +35 + 35): 
Since 1? + 2? + 32-4? 4 52 = 55 and 0?4+ 1° 4+ 2? 4 92 en 
it follows that 


go ee AE 
Us = $33 = a5, 

o- 2 BOL 5. 
L, = #3 a ae 


This establishes the desired bounds, since L, < ff < U,. 
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An alternative notation for the integral, which we shall use interchange- 
ably with - fv is . f(x) dx. This is the traditional way of writing the integral, 
and its usefulness will become increasingly apparent as we go on. In a later 
section we shall show how the dx which appears to the right of the integral 
sign may be interpreted as a differential. At present, however, it is important 
to realize that dx is only a part of the notation for the integral. The variable x 
which occurs in ip f(x) dx is often called a dummy variable. This name serves 
as a reminder of the fact that the value of the integral depends only on the 
function f and the numbers a and b. Its value is not determined by giving a 
value of x. Thus 


b b b b 
| r-[ roa- | row- | soa-eee. 


and x, y, and t each occurs as a dummy variable. Thus the inequalities es- 
tablished in Examples 2 and 3 can alternatively be written : 


Figure 7 


Although we shall not give a definition of area in this book, the basic 
properties of area can be used to establish its connection with the definite 
integral. Let the area of a set P be denoted by area (P). Two basic properties 
are: 


(1.2) The area of a set is never negative: area(P) > 0. 
(1.3) Jf P is a subset of Q, then area(P) < area(Q). 


In addition, we shall assume the elementary facts about the areas of 
rectangles. 
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Let f be a function which is integrable over the interval [a, b], and which 
also satisfies the inequality f(x) > 0 for every x in [a, b]. Let P be the region 
under the curve. That is, P is the set of all points (x, y) in the plane such that 
a<x<band0 <p < f(x) (see Figure 7). Next, consider two partitions 
o and 7 of [a,b]. The lower sum L, is the area of the union of rectangles 
contained in P; hence, by (1.3) we conclude that L, < area (P). Conversely, 
P is a subset of the union of rectangles the sum of whose area is the upper 
sum U,. Hence area (P) < U,, and we have shown that 


Lz \ wear) <2. 


for any two partitions o and 7. But the integrability of f asserts that the 
definite integral re f(x) dx is the only number with this property. Thus we 
have proved 


b 
(1.4) | f(x) dx = area(P), if f(x) = O for every x in [a, b]. 


EXAMPLE 4. Assuming the formula for the area of a circle and the integrability 
of the function \/1 — x? over the interval [0, 1], compute 


1 
| V/1 — x2 dx. 
0 


The graph of the function \/1 — x? is the upper half of the circle x2 + y? = 1 
shown in Figure 8. It follows from (2.4) that the integral in question is equal 


Figure 8 


to one fourth of the area of the circle. Hence 


u 
[5 sci ears UG 
[ viswax = 2. 
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PROBLEMS 


; 1 ' 
1. Draw the graph of the function f defined by f(x) = — , and answer the following 
questions. = 


(a) Is f bounded on the closed interval [2, 5]? 
(b) Is f bounded on the open interval (2, 5)? 
(c) Does fhave an upper bound on the interval (0, 2)? If so, give one. 


(d) Does fhave a lower bound on the interval (0, 2)? If so, give one. 


2. If the number M is the least upper bound of the set of all numbers f(x) for x 
lying in an interval J, we say simply that M is the least upper bound of f on J. 
A similar remark holds for the greatest lower bound. Draw the graph of the 


1 ' 
function f defined by f(x) = 5 and answer the following questions. 
aXe — 


(a) What is the least upper bound of f on the closed interval [2,3]? 
(b) What is the greatest lower bound of fon [2, 3]? 


(c) What are the least upper bound and greatest lower bound of f on the open 
interval (2, 3)? 


(d) What is the greatest lower bound of fon the interval (1, 2)? 


3. Compute the upper and lower sums U, and L, in each of the following examples. 


1 
(a) f(x) a ee [a, b] ae [1, 4], and ¢ = 1h zs a. 4}. 
xX 
(b) f(x) = 57 la.) = [0,2], and c = {0, 4, 2, 1, 4, &, 2}. 
(c) g(x) sy Po a 3 [a, b] = [0, rh; and lea {X0, X1, X2, X3, X45 ae where 
ty = 9 =O... 5. 


ke 
(d) g(x) oa <, [a, b] jt cf, 1], and ao fos ir, —3, 0 9 - t. 


4. True or false, and give your reason: If a function fis continuous at every x ina 
closed interval [a, b], then f has both a least upper bound and a greatest lower 
bound on [a, 5]. 

5. Assume that the function f defined by f(x) = x? + 1 is integrable over the in- 
terval [0, 1]. Using the partition o = {0, 4, 2, 2, , 1}, show that 


6. Assuming that the function g defined by g(x) = 2x is integrable over the interval 
[0, 2], use the partition « = {0, 3, 1, 3, 2} to show that 


2 
a< | 28 OX 3D. 
0 
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7. Assume that the function x? is integrable over the interval [0,1]. Using the 


PatithiOlics =i, nnn Xn ts WOSCTEIES 1 aa » for.0, 5. «_tat ee 
that 10 


8. Compute the definite integral £ i= fe jijac = £ f(t) dt in each of the 
following examples. Assume that f is integrable, and use Theorem (1.4) and 
the standard formulas for the areas of simple plane figures. In each case, 
draw the graph of fand shade the region P. 


ii 1 
(a) ? Jf, where f(x) = 4/1 — x2. (d) | (5 — 2x) dx 
—1 0 


2 1 
(b) | f(t) dt, where f(t) = t — 1. (e) | |x| dx. 
1 —1 


(c) [ 2xax 
0 


9. It is stated on page 169 that the first condition for integrability is always satisfied: 
If fis bounded on [a, 5], then there exists a real number J such that L, < J < U- 
for any two partitions o and 7 of [a, 5]. 


(a) Show that one such number is the least upper bound of all the lower sums 
L,. (This number is called the lower integral of ffrom a to b.) 


(b) Show that another possibility is the greatest lower bound of all the upper 
sums U,. (This number is the upper integral of f from a to b.) 


(c) Show that fis integrable over [a, b] if and only if the lower integral from a 
to b equals the upper integral, and that if the lower integral equals the upper 
then their common value is }, f. 


2. Sequences and Summations. We shall return to the definite integral in 
Section 3. The purpose of the present digression is to develop some tech- 
niques, applicable not only to the study of the integral but also to many other 
parts of mathematics. 

Most of the functions studied in this book have as domains intervals on 


: ; ; an 
the real line, or unions of intervals; e.g., the domain of the function — is the 
* 


union (— «, 0) U (0, ~). In this section, on the other hand, we are concerned 
with functions whose domains are sets of integers. An example is the function 
a defined by a(n) = V/n — 2, for every integer greater than 1. If ais a func- 
tion whose domain Is a subset of the integers, it is common practice to denote 
its value at n by a,. Thus 

Gy, = WG. 
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A simple example in which the domain is a finite set of integers is a partition 
of an interval in which we have indexed the points of the partition as 
XG; <3 3 5 Nps Sa, 


a= XG), for 7 = 1k. 8 


We come next to the definition of a sequence, which is a special case of a 
function defined on a set of integers. We shall accept the intuitive idea of a 
sequence to be that of a list (in mathematics, most likely, a list of numbers). 
With this in mind, we define a sequence to be a function whose domain Disa 
set of integers such that 


(i) Disa set of consecutive integers; i.e., if 7 andj are in D, then every 
integer between / and / 1s also in D. 
(ii) D contains a least element. 


If s is a sequence and if / is the least, or smallest, integer in its domain, 
then s(/) = s; is the first member of the sequence, s(/ +-1) = sj11 is the 
second member, and so on. In the most common applications / is either 0 or 
1, and so the values of the sequence appear as either So, 51, So,... Or as 
Sig Sis ae 

A sequence is finite or infinite according as its domain D is finite or in- 
finite. Note that the range of an infinite sequence need not contain infinitely 
many numbers. The function a defined, for every positive integer n, by 


0, if 1 is even, 
os  tiaoda. oe 


is the infinite sequence 1, 0, 1, 0, 1,0, 1,.... Aneven simpler example of an 
infinite sequence is the constant function b defined by 


ae 2 for every positive integer n. 


A common notation for a sequence s, whether finite or infinite, is {s,,}. When 
a sequence is written in this way, the letter n is called an index. Like the vari- 
able of integration in a definite integral, it is a dummy symbol. Any letter 
can be used, although n, m, i, 7, and k are the most common. Thus 


i = fs) = 1335 = ete. 


Of course, a finite sequence can be described by simply enumerating its terms, 
C85 81, «2 5 ys OF ay Signs os 5 10, 

We shall study two major topics in this section. The first is the limit of 
an infinite sequence. This is actually just an application of the idea of the 
limit of a function which we defined and studied in Chapter 1. As an example, 
let s be the infinite sequence defined by 

2n +n— 1 


= ees for every positive integer n. 
oe ae yP 
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We ask for the limit of {s,,} as m increases without bound, which we denote by 
lim s,. Dividing both numerator and denominator of the above expression 


no 


by n’, we obtain 


1 
If n is very large, it is clear that 2 + — — — is nearly equal to 2, and that 
) non 
3 — — + -; 1s nearly equal to 3. We conclude that the number which the 
aes | 


values of the sequence are approaching, i.e., the limit, is 4. Thus we write 


—  ) ee 
Hi te i EO ger egg 3: 


EXAMPLE |. Let {s,,} and {a,,} be two infinite sequences defined, respectively, 
by 


> — oe at fie 4, Sc 
/ = 2 
y 
Fidget Bit (oo a oes eee 
m 


Find lim s, and lim a,. For the sequence s, we divide numerator and 


no m—® 


denominator by +/n, getting 


yavees ype 


S erence ne eterna 
——+/5n — 2 Eas 
As RE 


n 
5 2 et ; 
Both — and — obviously approach 0 as a limit as n increases without bound. 
n n 
We conclude that 


lita ij. =i St 


Va of: 
ae oo 


5 


For the sequence {a,,} we have 


te 
Af! 


: I 
It is obvious that, as m increases without bound, so does m + —. Hence no 
m 


limit exists. On the other hand, we can unambiguously express the fact that 
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the values of the sequence are increasing without bound by writing 


2 
fae = toe mr ingots 


m—@ m—@ 


As we have remarked, the definition of the limit of a sequence is in- 
cluded in the definition of the limit of a function. For emphasis, however, 
we shall give it in this special case. Let s be an infinite sequence of real 
numbers. Then the limit as 7 increases without bound of s,, is equal to 5, written 

fo ss, = 2 

nas 
if, for any € > O, there exists an integer m in the domain of s such that when- 
ever n > m, then |s, — b| < €. The definition can be phrased geometrically 
as follows: The limit of {s,,} is b if, given an arbitrary open interval (b — e, 
b + e€), all the numbers s,,, from some integer on, lie in that interval. Thus for 
the oscillating sequence 1, 0, 1, 0, 1, 0, .. . defined in (1), no limit exists. 

The second topic in the section is the study of a convenient notation for 
the sum of a finite number of consecutive terms of a sequence. Let a be a 
sequence (finite or infinite) of real numbers. If m and 7 are in the domain of 
the sequence, and if m < n, then the sum a, + GQn4i + °°** + Gy is called a 
series and is abbreviated >>*_,, a;. Thus 


nN 
YD) Gg = Om Ami + AF Gn. 
=m 

We call >°"_,, a; the summation of {a;} from m to n. 


EXAMPLE 2. Let {a,' be the sequence defined by a; = i’, for every positive 
integer i, Then 


w=l1 


5 
Ya = P= P4+V47R44F 45 = 55. 
w=1 

Another series defined from the same sequence 1s 


6 6 
Ya = DP HaPH+HF4H+T+4+6 = 86. 
On the other hand, we might be interested in the sum of the squares of the 
first n integers for an arbitrary positive integer n. This would be the series 


SL Se eer 
tonk 


t=1 


The symbol 7 which appears in the series }/7_, a; is called the summa- 
tion index. It, too, is a dummy symbol, since the value of the series does not 
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depend on 7. Like the definite integral, >°7_,, a; depends on three things: 
the sequence a (the function) and the two integers m and n. Thus 


n 


Sa Ya 5 cy ay faye 
k=m 


=m 7=m 


EXAMPLE 3. Using the summation notation, write a series for the sum of all 
the odd integers from 11 to 101. An arbitrary odd integer can be written in 
the form 27 + 1 for some integer 7. It is not hard to see, therefore, that one 
answer to the problem is given by the series 


Ss a 


Another is the series 


ss G7 = 2). 


It should be emphasized that the summation notation offers no new 
mathematical theory. It is merely a convenient shorthand for writing sums 
and manipulating them. The ability to manipulate comes from practice, but 
the techniques are based on the following properties: 


(2.1) e (a; + b;) = > =F 23 bj. 


=m 


(2.2) 3 Ca; = C 3 a;. 


(2.3) . Cc 


c(n — m+ 1). 


Proof. The proofs are very simple. For (2.1) we have 


> (a; i bi) = (am + bm) = Gat Bs ren, S eee. = (an + bn) 
= (Gm F G@mx1  ** > + On) + On 2 ee 


=m 
n 


= at 


For (2.2), 
iS CO5-= Can > COmat Og 
= Clam + Qm4i1 + °°* + Gn) 


n 


C > aj. 


1=m 
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To prove (2.3), one must understand that >-7_,c¢ means > /j_ma:, where 
fa;\ is the constant sequence defined by a; = c. Hence 


n— m-+1terms 


nm n 

Do = Dy 4 = Gn F ang FF 
S¢4 C++ Se 
= c(n — m+ 1). 


This completes the proof. 
There are two other summation identities which are useful and which 


we shall include. They are the formulas for the sum of the first 1 positive 
integers and for the sum of the squares of the first ” positive integers: 


(2.4) 


(2.5) 6 


Proof. There is a very clever proof of (2.4), which the great mathematician Carl 
Friedrich Gauss (1777-1855) is said to have figured out for himself in a few 
seconds in his first arithmetic class at theage of 10. Writethe sum S = 07.11, 
once in natural order and, underneath it, the sum in reverse order as follows: 


Dee eae ee Ae Lem 
S=nt@—P +e +241. 


If each number on the right side of the first equation is added to the number 
directly beneath it, the sum is + 1. Hence the sum of the two right sides is a 
series consisting of m terms each equal to n + 1. It follows that 


2S = n(n + 1), 


from which (2.4) is an immediate corollary. 


Formula (2.5) is probably most easily proved by induction onn. The 
proof is straightforward, and we omit it. 


EXAMPLE 4. Evaluate 


@) >> Gi-+ 585 2, 


(b) ss 3i + 5i— 2) 
i=1 n3 
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Using the various properties of summation, we obtain for (a), 


Se ee AR + Sim 25 
tal 1 tea 


g Mat ene) set 
Sa a oe An 
= z 2 2 

3 #4 
= oe sn’ 4? kn, 


Part (b) is really a trivial modification of (a). The number n* which appears 
in the denominator is the same for each term in the sum, i.e., it is a constant, 
and can be factored out immediately. Thus 


2 seg eas 2 - TS eg 5) 
i=l 
Hence, using the answer from (a), we get 
3 Se = (n" + Aas +n) 
= oe ° “< = 


We conclude the section with an example which combines the summation 
convention with the limit of an infinite sequence, 


EXAMPLE 5. For every positive integer n, let S, be defined by 


n 2 
i +2 
ee 
‘ bat 
The numbers Sj, S, S3;,... form an infinite sequence, and the problem is to 
evaluate lim S,. Using the properties of summation, we obtain 


no 


Se 


=: A hes ee D + an 
i= Hee us) 


2 3h & Li? 
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Hence 


a, = (e- 


No no 6n3 
I l 13 l 
= lim (5+ 35 + ga) sf 


which is the answer to the problem. Frequently the notations are com- 
pounded; i.e., we write 


im 
n—D F 1 
PROBLEMS 
1. For each of the following sequences {s,', compute lim s,, if the limit exists. 
1 
cS ie ee ae - fOr 7 233, 2. 
n—1 
2n’ —3n+1 
(is = — ae. -tet B= 4523 5s 
1 ae 
2k, = ba oy for every positive integer 7. 
n— 2 ro ae 
3, = — >for every positive integer 7. 


nN 


2. Let sequences {a;\, {b;}, and {s,} be defined by 


( = 
Q= 1, 
| b; cs fj ee Es 
s ee 
oe Paes 
Evaluate 
4 4 oa 
(a) a; (d) a 
D 3 
o 2% (e) dos: 


j=—2 i=l 


(©) SS Qa; + 5b;) (f) So ashy. 


j=l j=0 
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3. Compute 
5 4 j 
@) >) @?-31+4) (d) ae 
(b) DIG +1)" - 7 jet a> x’. 
j=1 k=0 


(c) ¥ x 
k=0 


4. For any sequence ao, .. . , dn, show that >-%_1 (a; — a,—1) depends only on the 
first and last terms. 
5. Using the various properties of summation, evaluate 


n ‘ n oi i 

a > Gres ie Me : 
eal | a 

oH =a i © Cat 
= i=0 


Cea 
ten 


6. If f(x) = x? — x + 1, find 


(a) = fi) (c) ‘= fi) 
(b) > £(4) ay (4). 
1=1 1=0 


7. Compute lim S,, if the limit exists, for each of the following sequences. 


nO 


a soe 
(a) 5, = > ——,— > for every positive integer n. 
ti . 


“. 2i- 2 on 
ib) 5, = 2 = > for every positive integer n. 


ey B= n= 1,2,..., 
1=!1 
oa ce 
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8. Evaluate 
i _ wi6i — 2i+1 
© lin Of 0) tin SRB 


i=1 


9. Prove (2.5) by induction on n. 
10. Using the identity 


prove that 


Verify this result directly for 7 = 1, 2, 3. 

11. (a) How many presents did I receive from my true love on the 12th day of 
Christmas, when she gave me “12 drummers drumming, 11 pipers 
piping, ..., and a partridge in a pear tree”? 

(b) How many presents did I receive during the entire 12 days of Christmas? 
(Some familiarity with the words of the song is required.) 


3. Integrability of Monotonic Functions. Let f be a given function bounded 
on a closed interval [a, b]. How do we know whether or not f is integrable 
over [a, b], i.e., whether or not i f exists? In this section we shall give a 
partial answer, and also compute some integrals. Note that there is one 
situation where we know the answer immediately: If a = 6, then all upper 
and lower sums are equal to zero. Hence f is integrable, and 


[ r- [ toa-o 


So we now assume that a < b. For every positive integer n, we shall 
denote by a, the partition which subdivides [a, b] into n subintervals each of 


(3.1) 


—a 
length (AOS Bia i Xow dma Xehs, WNEKE 
b—a\. ; 
n= at (2= e Coe 0, ag FE 
n 
Moreover, 
—a 
4. Xi? bak Jn 
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The upper and lower sums of f relative to o,, will be denoted simply U,, and 
L,, respectively. That is, we abbreviate U,, by U,, and in the same way 
L,, by Ly. One criterion for integrability is expressed in the following 
theorem. 


(3.2) If lim (U,, — L,) = 0, then f is integrable over [a, b] and 


neo 


b 
lim Ly = tim 0, = i f(x) dx. 


no n—-@ 


Proof. We recall. the basic theorem of Section 1—that the upper and lower sums of 
f relative to any two partitions o and 7 of the interval [a, 5] satisfy the in- 
equality L, < U,. This implies, in particular, that any upper sum U, is an 
upper bound of the set L of all lower sums L,. Hence, by the Least Upper 
Bound Property, the set L has a least upper bound which we denote by J. 
Since this number J is an upper bound of L, we know that L, < J for every 
partition o. Furthermore, since J is a /east upper bound, we have J < U, 
for every partition r. Thus 


Lz JS U,, 


for all partitions o and 7 of [a, b]. As a special case of these inequalities, we 
have 

Gee eee SP for every positive integer 7. ee 
Since by hypothesis lira (U, — Ln) = 0, it follows that this number J is the 


n—-+ 


only number which can lie between all upper and lower sums. Hence, by the 
definition, fis integrable over [a, b] and J = £ f(x) dx. From (1) we obtain 
the inequalities 


0=< i =— Ls UU, LL for every positive integer 7. 


Since the right side of the above inequalities approaches zero, the expression 
in the middle is caught in a squeeze and must also approach zero. Hence 
lim (J — L,) = 0, or, equivalently, 


no 


b 
lea. je = | f(x) dx. 


n+ 


Finally, consider the identity U, = J + (U, — Ln») — (J — Ln). Since the 
two expressions in parentheses approach zero, it follows that 


and the proof is complete. 


An important class of functions to which the preceding theorem can be 
readily applied, and which we now define, is the class of monotonic functions. 
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To begin with, a real-valued function fis said to be increasing on an interval 
Tif the domain of f contains J as a subset and if, for every x; and x, in J, 


xX, <= x2 implies f(x) < f(%). ee 


If (2) holds for every x; and x, 1n the entire domain of f, we say simply that f 
is an increasing function. Companion definitions are obtained by simul- 
taneously replacing the second inequality in (2) by f(x) > f(x») and the 
word increasing by the word decreasing. For example, the function f defined 
by f(x) = x is increasing on the interval [0, «) and decreasing on the 
interval (— «, 0]. The function g defined by g(x) = —2x + 1isa decreasing 
function. 

Note that, according to our definition, a constant function is both in- 
creasing and decreasing. Thus “‘increasing,”’ as it is used here, literally means 
“nondecreasing,” and in the same way ‘“‘decreasing” means “‘nonincreasing.” 

A monotonic function is one which is either increasing or decreasing. 
Similarly, a function is monotonic on an interval if it is either increasing or 
decreasing on the interval. For such functions it is not difficult to prove the 
following integrability theorem. 


(3.3) THEOREM. [If the function f is monotonic on the closed interval [a, 5], 
then f is integrable over [a, b]. Specifically, lim (U, — Ly) = 0. 


no 


Proof. For the sake of concreteness, we shall assume that fis increasing on [a, 5]. 
An analogous argument works if fis decreasing. By far the best proof of this 
theorem is obtained from a picture, which provides a completely convincing 


y-axis 


y=f(x) 


f(6)— f(@) 


Figure 9 


argument. A typical example of an increasing function together with a 
partition of the interval is shown in Figure 9(a). The difference U, — Ly 
is equal to the sum of the areas of the shaded rectangles. By sliding these 
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rectangles under one another to form a single stack, we obtain the tall 
rectangle shown in Figure 9(b), whose area is also equal to U, — Ly. This 
b-—a 


rectangle has base and altitude f(b) — f(a). Its area is the product of 


these, and so 

b—a 
This difference can be made arbitrarily small by taking n sufficiently large. It 
follows that lim (U, — L,) = 0, and we conclude from (3.2) that fis inte- 


nO 


grable over [a, b]. This completes the proof. 


EXAMPLE 1. Evaluate iff x’ dx. The function f to be integrated is defined by 
f(x) = x’, and the interval of integration is [0, 2]. Since fis increasing on the 


interval, the integral certainly exists. The partition o, = {xo,..., Xn} 
which subdivides [0, 2] into n subintervals of equal length is given by 
wins y Aca Ag pens sh 2i 
n n n 


for each i = 0,...,”. Moreover, 


It follows from Theorems (3.2) and (3.3) that 


2 
| x’ dx = lim U, = lim Ly. 
0 


no no 


That is, we may compute the integral using either the lower or the upper 
sums. Choosing the latter, we observe from Figure 10 that, on each sub- 
interval [x;_1, x;], the function f has its maximum value at the right endpoint, 
t.e., at x; Henee 


M,; = f(x), 2g eee 


21 4 
Since f(x;) = x3 and since x; = —, it follows that M, = x3 = —. Sub- 
A n 


u 


stituting in the formula for the upper sum, 


ite — >> Mx; ae #53), 


4I=l1 


we obtain 
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From (2.5), we have 


Sot = Mat Qn + 1) _ 2a! + 3a 


ie 6 6 

Hence 
3 2 
ee 4, FS . 
ns 6 3 nh n2 

and so 

| 3 l 4 8 

i Uy = tim $(2 +342) a 


We conclude that 


x-axis 


Figure 10 
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It was shown in Section | that the integral of a nonnegative function is 
equal to the area under its graph. It follows from the above example that the 
area of the region bounded by the parabola y = x’, the x-axis, and the line 


x = 2 is equal to 4. 


EXAMPLE 2. Evaluate - (5 — x) dx. The function f, defined by f(x) = 5 — x, 
is linear and decreasing on the interval [1, 4]. Its graph is shown in Figure 11. 
The partition o, = {Xo,..., Xn} subdivides the interval [1,4] into subin- 


3 
tervals of length = —, and the points are given by 
n 


mat +(@)i | eee. 
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In addition, 


LS eS SS 9 as 


[CHAP. 4 


We shall compute the integral as a limit of lower sums, and it follows from 


Theorems (3.2) and (3.3) that 


i 


OG — xox = te £,. 
1 


no 


y-axis 


Figure 11 


Since f is decreasing, its minimum value on each subinterval [x;_1, x;] occurs 


at the right endpoint. Hence 
m; = f(x), oe Cee 


3; 
We have x; = 1 + = and f(x;) = 5 — x,, and so 
n 


: 3 
SINCE: iy: SX gto = we Bet 
n 


Ly, = > mx; — X;_1) = 3 (4 — 2) 3. 
t=a1 


Faas 
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The rest of the problem uses the manipulative techniques of the summation 


convention. 
. Sty 3 “/12 9 
i= > (4-#)5- + (2-3) 
$2 _ 9 
= 1=1 s: t=1 n* 
i 9. 
= n ied = I du : 
l 
Siice >, 1° >-# and sitice 5 7, fF = mar” , we get 
Hed! ees 
n n? Zz 


9 l 
ae (1 a. ‘) : 
But it is easy to see that 


: 9 1 9 l 
tim [12-3 (1 +})\|- es 


and we finally conclude that 


zt 


— G — x)dx = lim Ly = 74. 


This answer can be checked by looking at Figure 11. The value of the 
integral is equal to the area of the shaded region P, which is divided by the 
horizontal line y = 1 into two pieces: a right triangle sitting on top of a 
rectangle. The area of the triangle is $(3 - 3) = 3, and that of the rectangle is 
3° Ps3 eeece 


4 
| (5 — x) dx = area(P) = 3+ 3 = 75. 
i 


The excessive lengths of the computations in Examples | and 2 make it 
obvious that some powerful techniques are needed to streamline the process 
of evaluating definite integrals. The advent of modern high-speed computers 
is one answer to the problem, and occasionally, as in Example 2, a simple 
formula for area will do the trick. The classical solution to the problem, 
however, is the Fundamental Theorem of Calculus, which we shall study in 
detail in Section 5. 
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PROBLEMS 


1. Evaluate the following definite integrals by finding the limits of the upper or 
lower sums. 


1 3 
(a) [Pa (c) [wena 


2 1 
(b) | 2x dx (d) | (3x° + 1) dx. 
0 0 


2. For each of the integrals in Problem 1, draw the region whose area is given by 
the integral. 

3. Let f be the step function defined by f(x) = i, if i — 1 < x < i, for every in- 
teger i. Draw the graph of fand compute the following integrals. (Hint: These 
problems are neither hard nor long. They require an understanding of the 
definition of integrability and possibly some ingenuity.) 


He 3 
(a) : 7 (c) | 7 

it —1 

3 ff 
(b) | a (d) : af 


4. Every constant function is both increasing and decreasing. A stronger condition, 
which excludes constant functions, is obtained by defining f to be strictly in- 
creasing if 

x <y imphes _f(x) < #0), 


for every x and y in the domain of f. The companion definitions of what it means 
for a function to be strictly decreasing, strictly increasing on an interval, etc., 
should be obvious. Using the Mean Value Theorem (page 113), prove that if a 
differentiable function f satisfies the inequality f’(x) > 0 for every x in an in- 
terval J, then f is strictly increasing on I. 

5. Prove the converse of Theorem (3.2); i.e., if f is integrable over [a, 5], then 
lim (U, — Ln) = 0. (This is a difficult problem.) 


nD 


4. Properties of the Definite Integral. If a function f/ is integrable over an 
interval [a, b], then in the definite integral 


b b 
[ r=] soa 


the function f is called the integrand, and the numbers a and 56 the limits of 
integration. 

The basic properties of the definite integral are contained in the follow- 
ing five theorems. 
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(4.1) If f(x) = k for every x in the interval [a, b], then 


b b 
[ seoax = | k dx = k(b — a). 


(4.2) The function f is integrable over the intervals [a,b] and [b, c] if and 
only if it is integrable over their union [a, c]. Furthermore, 


b c c 
[ soos | ponas= | pooax 


(4.3) If f and g are integrable over [a, b] and if f(x) < g(x) for every x in 
[a, b], then 


b b 
[ soars | eeoax 


(4.4) If f is integrable over [a, b] and if k is any real number, then the product 
kf is integrable and 


b b 
| kre ax = k| f(x) dx. 


(4.5) If f and g are integrable over [a, b], then so is their sum and 


b b b 
| [f(x) + g(x)] dx = | f(x) dx + | 8(x) dx. 


None of the proofs of these theorems is deep in the sense of requiring 
great ingenuity or any techniques beyond the use of least upper bounds and 
greatest lower bounds. However, they vary considerably in the amount of 
detail required. The proof of (4.1) is a triviality. For if f has the constant 
value k on the interval [a, b], then, for every partition o of [a, b], the upper 
sum U, of f relative to o is equal to k(b — a), and so is the lower sum. Thus 


L, = k(6 — a) = U,, 


which proves both that f is integrable and that the value of the integral is 
k(b — a). 

The proof of (4.3) is slightly more difficult and probably most easily 
obtained by contradiction. Suppose the premise true and the conclusion 
false. That is, we assume that iF n> + g. The definition of integrability 
asserts that if a function is integrable over an interval, then there exist upper 
and lower sums lying arbitrarily close to the definite integral. Therefore, 
since g 1s integrable and since J. g< fe f, there must exist an upper sum for 
g which is less than f f. Specifically, there exists a partition o of [a, b] such 
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that the upper sum of g relative to a, which we shall denote by U,(g), satisfies 


the inequality : : 
| PSUs) < [ I. 


But since f(x) < g(x) for every x in [a, b], the corresponding upper sum of f, 
denoted U,(/), is less than or equal to U,(g). Thus we obtain the inequalities 


5 
U.(f) S U.(g) < | I 


However, every upper sum of f is greater than or equal to the integral { Z. 
Hence we have arrived at a contradiction, and (4.3) is proved. The proofs 
of (4.2) and (4.5) are given in Appendix B, and that of (4.4) is assigned as a 
problem at the end of the section. 

The additivity property of the integral stated in Theorem (4.2) obviously 
extends to any finite number of intervals. Thus if o = {x0,...,X,} is a 
partition of [a, b] with a = x» < x1 < ‘+> < x, = 5, and if f is integrable 
over each subinterval [x;_1, x,;], then by repeated application of (4.2) it 
follows that f is integrable over [a, 5] and that 


b = v; 
| I(x) dx = ss 8 Fix ax. (1) 


In Section 3 it was proved that if a function is monotonic on a closed 
interval, then it is integrable over that interval. Theorem (4.2), as extended 
in equation (1), increases the scope of this result enormously. For although a 
function f may not be monotonic on a given interval [a, b], it is frequently 
possible to partition [a, b] into subintervals on each of which fis monotonic. 
It then follows that f is integrable over the entire interval; 1.e., ib I(x) dx 
exists. 


EXAMPLE |. For every nonnegative integer n and interval [a, b], show that 
the definite integral 
b 
| Sax 
a 


exists. To say that [ 7 x" dx exists is just another way of saying that the func- 
tion f defined by f(x) = x” is integrable over [a, b]. We now prove that this 
is so. For every nonnegative integer n, the function x” is an increasing func- 
tion on the interval [0, «), and it is an increasing or a decreasing function on 
(— «, O] according as n is odd or even. Hence if [a, b] is a subset of [0, 0) 
or a subset of (— «, 0], then the function x” is monotonic on [a, b] and is 
therefore integrable over that interval. The remaining possibility is that 
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a <0 < 5. In this case, x” is integrable over the intervals [a, 0] and [0, 5] 
separately. It then follows that x” is integrable over their union, which is 
[a, b], and the proof is complete. 


Just as Theorem (4.2) was generalized to more than two intervals, 
Theorem (4.5) can be extended to any finite number of functions. Thus if 
each one of the functions fi, ... , f, is integrable over [a, b], then by repeated 
applications of (4.5) it follows that the sum f; + --- + f, is integrable over 
[a, b] and that 


b b b 
[ Ue +--+ poonas= | ponart--+ | Ir{x) dx. (2) 


EXAMPLE 2. Consider an arbitrary polynomial 


P(x) = A,x”" + Ve eae 7 ee it Ay 


and a closed interval a,b}, “Then, for-each+— 0... , #7, we know from 
Example 1 that iB x' dx exists. It follows by (4. 4) thas each function a;x* is 
integrable over [a, b] and that £. a,x dx =.a; ie x’ dx. We conclude from the 
preceding paragraph that the polynomial p(x), which is the sum of the func- 
tions a;x*, is integrable and that 


b _ b 
| p(xydx = a; | x’ dx. 3.) 


As a concrete example of equation (3), consider the polynomial 7x° — 3x? + 
x’ + 3. We have immediately 


b 
| ar ee 


b b b b 
a3) dx — 3 x° dx + vdx +3 1 dx. 


Since we know from (4.1) that f 1 dx = b — a, the last term in the above 
equation can be replaced by 3(b — a). 


Summarizing Examples | and 2, we conclude that all polynomials are 
integrable and that the problem of computing their integrals reduces to the 
problem of computing the integrals of the positive powers of x. 

The interpretation of the definite integral as an area will now be general- 
ized to include functions which may take on negative values. To begin with, 
suppose that a function f is integrable over [a, b] and, in addition, that 
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f(x) < 0 for all x in [a,b]. The graphs of both f and —f are drawn in 
Figure 12. As shown in the figure, we denote by P the region consisting of all 


Figure 12 


points (x, y) such that a < x < band f(x) < y < 0, and, similarly, by Q 
the region defined bya < x < bandO < y < —f(x). It is obvious that 


area(P) = area(Q). 


It follows from Theorem (4.4), by taking k = —1, that the function —f is 
integrable over [a, b] and that 


| . (—f(x)) dx = — | “f(s) dx 


Since —f(x) > 0 for every x in [a, 5], we know that 


b 
| (— f(x)) dx = area(Q). 


Combining the preceding three equations, we conclude that 


b 
| f(x) dx = —area(P). 


Next, we suppose that fis integrable over [a, b] and takes on both positive 
and negative values. Specifically, let fa, b] be partitioned by inequalities 


OE NP NS, lo ae Sth 


so that on each subinterval [x;_1, x;] the function f is either nonnegative or 
nonpositive. We denote by P* the set of all points (x, y) such thata <x <b 
and 0 < y < f(x), and by P the set of all points (x, y) such thata < x < b 
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and f(x) < y < 0 (see Figure 13). It follows from the conclusion of the 
preceding paragraph and from the additivity of the integral, as generalized 
in equation (1), that 


b 
(4.6) | f(x) dx = area(P*) — area(P" ). 


This is the principal geometric interpretation of the integral. 


Figure 13 


EXAMPLE 3. Evaluate is (x? — 3x) dx. The integrand, f(x) = x* — 3x, isan 
an odd function; i.e., the equation f(— x) = —/f(x) is satisfied for every x. Its 
graph, drawn in Figure 14, is therefore symmetric under reflection first about 
the x-axis and then about the y-axis. It follows that the region Pt above the 
x-axis has the same areas as the region P” below it. We conclude that 


2 


ig = 3x) dx = 0. 


—2 


y-axis 


Figure 14 
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The final topic of this section is an extension of the definition of the 
integral. Up to this point, * f(x) dx has been defined only if a < b. It 
turns out to be algebraically more convenient to remove this restriction. 
We do so by decree: If /f is integrable over the interval [a, b], then we now 
define 


a b 
[ pooa=- | pooax (4) 


It is a simple matter to verify that the equations which form the conclu- 
sions of Theorems (4.1), (4.4), and (4.5) remain true, in the light of the 
extended definition of the integral, if a and 6 are interchanged. Thus 


b 
| k dx = k(b — a), 
b -b 
| Kiyo = K| f(x) dx, 


b b b 
| [f(x) + g(x)] dx = | f(x) dx + | g(x) dx, 


are valid equations regardless of whether a < bor b < a. 

On the other hand, if a and 6 are interchanged in the conclusion of 
Theorem (4.3), then the direction of the inequality must be reversed. 

Less trivial to verify, but equally important, is the generalized form of 
(4.2): 


(4.7) If f is integrable over the smallest closed interval which contains the 
numbers a, b,.and c, then 


b € Cc 
[ soyav+ | sonar = | baba en 


The proof is obtained from (4.2) and the definition (4) by simply check- 
ing each of the six possible cases in turn: 


iy €2 bee 
Gi) 2 = 5. 
(iit) beten < c. 
(i) oS eae 
uy 2 = a ee. 


iW) -¢ = bh 


The details are tedious, and we omit them. 
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PROBLEMS 


1. Expand each of the following integrals. That is, write each one as a sum of 
constant multiples of the integrals of the powers of the variables. 


1 3 
@ | o+soas @ | ap®-1wi +40 
0 Dd 
x if 
(b) | ei— ¥ = 2x (e) | Ge & 2Y dx, 
2 0 


2 
(c) | Gr =F 4 Har 
1 


2. Given that hs Crdxy = ra , for every nonnegative integer n, evaluate 
n : 
it il 
(a) | (2x° + 3x) dx (d) | (x + 2 dx 
0 0 
it 1 
) | ox? - 3° — 24s © | @w-r+ne 
0 0 


1 
(c) | Gr = tha 
0 


3. Use the result 


and the analogous result at the beginning of Problem 2 to evaluate 


9 2 
(a) | Gx = 2x Ta (c) | (4x° — 3x + 2adx 
if 0 


(b) | x’ dx (d) | 2 FE edi. 


4. Using the definition of integrability, prove Theorem (4.4). (Suggestion: Treat 
the cases k > 0 and k < 0 separately.) 

5. Using (4.3), prove that if f and g are integrable over [a, b] and if f(x) = g(x), 
for every x in [a, 5], then 


b b 
f(x) dx = | a(x) dx. 
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6. Prove that if fis integrable over [a, b] and if f(x) < M for all x in [a, 6], then 


b 
| f(x) dx < M(b — a). 


7. Replace the symbol * by either < or > so that the resulting expressions are 
correct. Give your reasons. 


1 1 3 3 
(a) | x’ dx * | x’ dx (c) | x’ dx + | x? dx. 
0 0 1 il 


1 1 
(b) x dx + | x’ dx 
=i mal 
8. Plot the graph of the function f(x) = 1 — x’, and indicate the region Pt 
defined by the inequalities 0 < x < 2 and 0 < y < f(x) and the region P— 
defined by the inequality 0 < x < 2 and f(x) < y < 0. 


(a) Use the identities given in Problems 2 and 3 to evaluate the integrals 


fo £00 dx, SP fOd dx, and fy fGd de. 
(b) Find area(Pt), area(P-), and area(P* U P-). 


9. Draw the graph of the function f(x) = x(x — 2)(x — 4) = x*° — 6x° 4+ 8x, 
and indicate the region P* defined by the inequalities 0 < x < 3 and 
0 < y < f@), and the region P- defined by 0 < x < 3 and ja 7.) 
Let P = P+ UP, and suppose that is i) dx = 4 and cs f(x) dx = 23. 
Find area(P*), area(P~), and area(P). 

10. Prove case (ili) of Theorem (4.7). 

11. Consider a function f which is integrable over [a, 5] and which, in addition, 
satisfies: 


(i) fis continuous at every point of [a, 4]. 
(ii) f(x) > 0, for every x in [a, 5). 
(iii) f(c) > 0 for at least one point c in [a, 5]. 
Prove that [. f(x) dx > 0. 


5. The Fundamental Theorem of Calculus. In spite of the fact that we have 
thus far developed a significant amount of the theory of the definite integral, 
the actual evaluation of { I(x) dx, for even a very simple function f, is gener- 
ally an arduous task. For a wide class of functions, the problem of computa- 
tion is solved by a theorem which relates the definite integral to the derivative 
and which has become known as the Fundamental Theorem of Calculus. 
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To understand clearly our presentation of this important result, it is 
necessary to be aware of the following integrability theorem. 


(5.1) THEOREM. [f the function f is continuous at every x in the closed interval 
[a, b], then f is integrable over [a, b]. 


We shall give only a brief outline of the proof, which shows that the 
result is plausible. Since fis continuous, its values do not vary widely over a 
small interval. Recall that the nth upper and lower sums are defined by 

U,, 
Ly 


M(x = Xo) = ee Mi (Xn = ees F 


m(X1 st Xo) = = - Aa Si Xen, 


where M; and m; are, respectively, the maximum and minimum values of f 
on the ith subinterval. We assume that n is large and each subinterval small. 
By the continuity of f, therefore, the difference M; — m; is small for each 
j= 1,...,8. tem tim implies that U, — £15 small. In fact, U, — £, 
can be made arbitrarily small by taking nv sufficiently large; Le., 
lim (U,, — L,) = 0. This limit is sufficient to prove that fis integrable over 


na 


[a, b], as shown in Theorem (3.2). To change this outline into a complete 
proof, it is necessary to introduce the concept of uniform continuity, which 
we shall not do in this book. 

We shall now show how the definite integral can be used to define a new 
function. Suppose that fis a given function which is continuous at every x 
in some interval J. Let a be an arbitrary number in J. A new function F is 
defined, for every number ¢ in J, by the equation 


F(t) = | FiO ax. 


The existence of £ f(x) dx follows from the continuity of f and the inte- 
grability Theorem (5.1). Thus the function F is well defined by the above 
equation. 

Using the interpretation of the integral as area, we can give geometric 
meaning to F(t). Suppose that a < ¢ and that f(x) a 0 for every x in the 
interval [a, ¢], as shown in Figure 15(a). The integral s f(x) dx is then equal 
to the area of the region P bounded by the graph of f, the x-axis, and the lines 
x = aand x = ¢t. Thus 


F@) = area’). 
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On the other hand, if t < a and f(x) > 0 for every x in [f, a], which is the 
situation shown in Figure 15(b), then the area of P is equal to the integral 


y-axis y-axis 


t 
F()= f f(x)dx 


t 
F(t)= f f(x)dx 
= area ({P) “i 


= — area (P) 


Figure 15 


| i F(x) dx. Hence, in this case, we have 


FQ): = | f(x) dx = - | f(x) dx = —area(P). 


In the general case, of course, f may take on both positive and negative values. 
If the region bounded by the graph of f, the x-axis, and the lines x = a and 
x = tis expressed as the union of the part Pt above the axis and the part P— 
below the axis, then 


F(t) = +[area(P*) — area(P7)], 
where we take + or — according asa < tort <a. 


We come now to the main result of the section. 


(5.2) THE FUNDAMENTAL THEOREM OF CALCULUS. Let f be continuous 


at every x in some interval I, and let a be a number in I. If the function F is 
defined by 


Fi) = | F(x) dx, for every t in I, 


then F is a differentiable function and 
FP’) = f(), for every t in I. 


Proof. According to the definition of the derivative, we must prove that 


lim “eee = f(t). 


h50 
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By the definition of the function F, 


t-+h 
F(t +h) = | FO) ae. 
Hence 
t-th t 
F(t +h) — F(t) = | f(x) dx — | fixydx. 


By Theorem (4.7) we have 


t t+h t+h 
| fix) a+ | faye = | f(x) dx, 


and the preceding two equations therefore imply that 


t-+h 
Fit +h) —-— F(t) = | f(x) dx. 


Consequently, 


t+h & 
ees FO) 1f fCAdx. (1) 


These steps are illustrated geometrically in Figure 16. 


y-axis 


FUt+ h)— EQ) 
t+h 


= f f(a)ax 
t 


Figure 16 


Let the maximum and minimum values of f between ¢ and t + h be 
denoted by fi, and fm, respectively. Thus 


Sm S f(x) S fs 
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for every x between t and ¢ + A. If / is positive (as it is in Figure 16), then it 
follows by Theorem (4.3) that 


t-+h t+h th 
| Int | FHA | Fu ax. 
t t t 


Since f, and fy are constants, Theorem (4.1) implies that 
th 
/ Sm Ax = fn s(t +h — t) = fnoh, 
t 


i-th 
| Iu dx = fu'@+t+A-—1t1) = ful. 


Hence 


t-th 
fons | Jia <a: hh, 


or, equivalently, 


: t-th 
pis | f(x) dx < fi. (2) 


If, on the other hand, / is negative, it is a straightforward (and logically 
necessary) matter to verify that the same inequalities (2) follow. Combining 
(1) and (2), we therefore obtain 


far ES < fy (3) 


Finally, since fis continuous at ¢, we know that 
lim 7, = fp =i: fy. 
h—-0 h—0 
F(t +h) — F(t) 
h 


quantities both of which approach f(7) as A approaches zero. It, too, must 
therefore approach f(t) as a limit. We conclude that 


The fraction is thus seen in (3) to be caught between two 


F'(t) = lim Fey = FW) = f(t), 


h—0 h 


and the proof of the Fundamental Theorem is completed. 


Before reaping the computational rewards of this theorem, we give a 
concrete example to emphasize precisely what the theorem says. 
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t 
l 
EXAMPLE 1. If F(t) =f eh | o% mos 11), Ff 2), and F(x). The 
0 


integrand in this example is the continuous function f defined by 


l 
2332 mee = In this case, therefore, the interval J can be taken to be the 
whole real line. By the Fundamental Theorem, 
| 
/ me ae oak 
F() = fi) = 5 
In particular, 
nae 1 
ee +1. 2 
l | 
/ = DSC > ss 
Bere =5, +1 5 
and, in general, - 
ie ee l 
Fy = oj for every real number x. 


As the preceding example illustrates, the conclusion of the Fundamental 
Theorem can equally well be written 


fix f(x), for every x in J. 


We used the letter ¢ in the statement of the theorem simply to avoid con- 
fusion with the dummy variable x which appears in the integral. We might 
just as well have written 


1 Fi) = | f(t) dt, then F(x) = f(x), 
or, perhaps better yet, 


If F(x) = | f, then F(x) = f(x). 


The important thing to remember is that the derivative of F at any point in 
the given interval is equal to the value of the integrand f at that same point. 
We now consider the implications of the Fundamental Theorem. By an 
antiderivative of a function f is meant any differentiable function F with the 
property that F’(x) = f(x) for every x in the domain of f. Similarly, we shall 


say that a function F is an antiderivative of f on an interval J if F’(x) = f(x) 
3 


; : x, ; ——— 
for every x in J. Thus the function =. is an antiderivative of x? because 
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ae Sera ; 
Of course, = is an antiderivative of x° on any interval we care to name. 


The Fundamental Theorem states that if fis continuous at every point of J, 
then the function F defined by 


x 
oe 
a 
is an antiderivative of f on J. 
If a function f has one antiderivative F, then it has infinitely many 
because, for every constant c, 


(Fo of = Foe a 0 = f. 


Conversely, we have proved that any two functions which have the same 
derivative differ by a constant [see Theorem (5.4), page 114]. Hence, if 
F’ = f, then the set of all antiderivatives of fis the set of all functions F + c 
for every real number c. Combining these facts, we obtain 


(5.3) COROLLARY OF THE FUNDAMENTAL THEOREM. Let f be a function 
which is continuous at every x in some interval I. Then f has an antiderivative 
on I. Furthermore, if F is any antiderivative whatever of f on I, then, for any a 
and b in I, 


b 
f f(x) dx = FU) — FW). 


This theorem is the computation tool which we have been seeking. 
Before giving the proof, which is easy, let us see how it works. 


EXAMPLE 2. Evaluate the definite integrals 


(a) | x’ dx, 
(b) | (5 — x) dx. 


Both integrands are obviously continuous functions. As already observed, 
3 


the function F defined by F(x) = 3 is an antiderivative of x’. Hence, by 


Theorem (5.3) 
2 
| x Ox 
0 


F(2) — F(0) 
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Similarly, we can see by inspection that the function G defined by G(x) = 


2 
bas i se 44, 
5x — a is an antiderivative of 5 — x, since 


d 5 
£ (sx - %) ee Oe 


It therefore follows by Theorem (5.3) that 


| (5 — x) dx 


G(4) — G(1) 


lI 
a i 
Nn 
ss. 
| 
NI, 
is Oi cae 
| 
ra 
Nn 
ma 
are 
Pisciacnaete 


12 — 44 = 74. 


These are the two integrals whose values were computed in Section 3 by 
finding the limits of upper and lower sums. The difference in the magnitude 
of the computations there and here should render unnecessary any comments 
on the significance of the results of the present section. 


Proof of Theorem (5.3). The assertion that fhas an antiderivative on J is verified by 
the Fundamental Theorem. Let G be the antiderivative defined by 


G(3) = | f for every x in I. (4) 
Suppose now that F is an arbitrary antiderivative of fon J. Then 
or er) Fo), for every x in J. 


It follows by Theorem (5.4), page 114, that on the interval J the two functions 
G and F differ by a constant. That is, there exists a real number c such that 


G(x) = F(x) + ¢, for every x in J. 


Substituting x = 5 in equation (4), we obtain 


b 
cw) - | .. 


Moreover, we know that G(a) = ij. f =.0.. Hence 


b b 
[ seo = | . 


and the proof of (5.3) is complete. 


= G(b) — Gia) 
= {F(6) + c] — Fe) + <] 
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The following is an extremely useful notational device. For any real- 
valued function F of one variable, we abbreviate F(b) — F(a) by F(x)|°. If F 
is an antiderivative of the continuous function f on some interval containing 
the numbers a and 6, then we may write the value of the definite integral as 


P b 
[ foo ax = Fe) ; 


The advantage of this notation is that the order of writing is the same as the 
logical order in which the problem is done. Thus one first writes the anti- 
derivative, and then indicates the numbers at which it is to be evaluated. 
As a result, the whole problem can frequently be done in a single line. For 


example, 
2 3 
9 X 
ie 
| x dx = 5 


EXAMPLE 3. Evaluate the definite integral = (” — 3y? + 2) dy. 


2 i 0° 
go 22: 


0 


Note that we get the same answer whether the dummy variable of integration 


is y, x, or anything else. The integral is the function f defined by 
6 


Zs 
6 P 
antiderivative of 3y’ is y’, and an antiderivative of 2 is obviously 2y. Hence 
6 


eo 
6 


f(y) = y’ — 3y? + 2. An antiderivative of y’ is easily seen to be an 


— y® + 2y is an antiderivative of f. We conclude that 


1 


I 6 
fo — 3y’ + 2)dy = (% - »' + 29) 


_({U_fa5 1) 2 eee 
-(F-142-1)-(& (-1)' + 2(-) 


$-14+2)-@+1-2) 
ie 


PROBLEMS 


1. Verify that 


1 x’t! is an antiderivative of x’, if r is any rational number 
r 


except —1. 
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2. Find an antiderivative of each of the following functions. 


(a) f(x) = x" (d) f() = 5° +30 +1 

6) fw=X+5 (2) g(x) = Gx + 1) 
2a eS 

©) fo) = ON Oe eerie 


3. Evaluate each of the following definite integrals by finding an antiderivative and 
using Theorem (5.3). 


1 1 
(a) [ae ae (i) [ o-v+ne 


1 0 
(b) / (4x° + 3x° +2x+1)dx (j) / (x° — 9x” + 16x) dx 
0 6 


3 5 
(c) 1 (5x — 1) ax (k) | (2x — 1) dx 
1 3S 
3 ac 
(d) / (St = tas (1) / (6¢ — at + 2)at 
1 3 
2 t 
(e) 7 6 — ) dx (m) / (x? + 3x — 1)dx 
1 x 0 
S Pe 
(f) i x? dx (n) | sds 
3 0 
0 b 
(g) ' y dy (0) | dx 
2 ax 
(h) / (2 + = + 2) dx (p) | (4t — 1) dt. 
1 Xx x x 


4. Let n be a positive integer. 
(a) Evaluate f° x" dx. 


(b) Evaluate | has provided (i) n ¥ 1, and (ii) a and bd are either both 
positive or ‘both negative. 
(c) In (b), what is the reason for proviso (i)? For proviso (ii)? 
5. Let F be an antiderivative of f and G an antiderivative of g. 
(a) Prove that F + G is an antiderivative of f+ g. 


(b) For any constant k, prove that KF is an antiderivative of kf. 
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t 
6. What is the domain of the function F defined by F(t) = | : dx? 
oe 
7. Let F(t) = f (6x? — 4x + 1) dx. 


(a) Using just the Fundamental Theorem and without evaluating F, find 
F'(t), F’(—1), F’(2), and F'(x). 


(b) Find F(f) as a polynomial in ¢ by finding a polynomial which is an anti- 
derivative of 6x? — 4x + 1. 


(c) Differentiate the answer in (b), and thereby check (a). 


8. Let G(x) = | (: a *) di,tor x > 0. 
1 


(a) Using just the Fundamental Theorem, find G’(x) and G’(2). 
(b) Evaluate G(x) as a rational function of x by finding an antiderivative of 
+8 
(c) Take the derivative of G(x) as found in (b) and thereby check (a). 
9. (a) Evaluate F(t) = fi (Gx? + 1) dx. 
(b) Find F’(#) and F’(2) by taking the derivative of the answer to (a). 


(c) Find F’(#) directly using just the Fundamental Theorem and the Chain 
Rule. 


10. If f is continuous and g is differentiable and if F() = f f(x) dx, use the 
Fundamental Theorem and the Chain Rule to show that F’(t) = f(g(n) )g’(d. 

11. In each of the following integrals evaluate F’(t). Do not attempt to first find an 
antiderivative. 


(a) F(t)-= | /1 + x3 dx. 
0 


1 


2t+1 ' 
(c) F(t) = | yn La 


{2 


12. Is there anything wrong with the computation 


jee tane. 


1 


=(-1)- (lI) =- 


If so, what? 
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13. In each of the following, find the area of the subset P of the xy-plane bounded 
by the curve y = f(x), the x-axis, and the lines x = a and x = b. Sketch the 
curve and the subset P. 


(a) fix) = 2 4 1.a@=— <1, and db = 2. 
(b) fG) = + 2c 2 = 0 and} =°2. 
(c) f(x) = $x + 1,4 = 2, and db = 4. 
(d).: JQ) =a pe = Gand 5 = 2. 
(e) f(x) = x* — 2x? + x,a = 0, and d = 2. ; 
14. Find the derivative of the function F defined by F(x) = / 
0 


1 
dt. Sketch 
rt i 
the graph of F using the techniques of curve sketching discussed in Section 1 of 
Chapter 2. Label any maximum, minimum, or critical points and any points of 


inflection. What is the domain of F? (Do not attempt to find an explicit 
1 ; 
r+ I 


6. Indefinite Integrals. In this section we shall study the problem of finding 
a function given its derivative. The topic is a large one, and the present 
treatment is only an introduction. Many techniques for finding a function 
whose derivative is known have been developed, and some of these will be 
studied in Chapter 7. 

Recall that an antiderivative of a function fis any differentiable function 
F with the property that F’(x) = f(x) for every x in the domain of f. An 
antiderivative of f is also called an indefinite integral of f and is denoted 
by 1f@ dx. If F’ = f, we write 


antiderivative of 


| f(x) dx = F(x) + ¢. 


Since the most we know about J f(x) dx and F(x) is that they have the same 
derivative f(x), they may very well differ by a nonzero constant. If the con- 
stant c is omitted, there is a very real possibility of making an error, since a 
particular indefinite integral may not be the one which is the solution to the 
problem at hand. 


EXAMPLE |. At every point (x, f(x)) on the graph of a given function /f, 
there is a tangent line with slope equal to x”. If the graph passes through the 
point (3, 2), find f. The solution is based on the fact that the slope of the 


tangent line is given by the derivative. Hence f’(x) = x?. One function with 
3 


; i . oe. 
this derivative is < and so 


f)=yte, (1) 
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for some constant c. We also write 


3 
[#ac- te 


Since the point (3, 2) lies on the graph, we know that f(3) = 2. Thus 


3 
2=fG@)=F+e=Ite, 
whence c = —7, and we conclude that 
x? 
i) = a Z; 


Omission of the c in equation (1) would have lead to the incorrect answer 
3 


FQ) = > 


The reason for calling an antiderivative of a function f an indefinite 
integral and for denoting it by f f(x) dx is its close connection with the def- 
inite integral. Let f be continuous on an interval containing a and 6. Since 


d 
- { f(x) dx = f(x), we obtain the formula 


b 
; (2) 


[ 50 dx = | 109 dx 


by applying.Corollary (5.3) of the Fundamental Theorem of Calculus. The 
value of f f(x) dx|° is the same for any two indefinite integrals of f and there 
is therefore no need to include the constant c in applications of equation (2). 
For example, even though 


foxtnav-8 4x4 


for an arbitrary real number c, we may write 


= 6. 


2 
0 


| Qx + Idx = [x + 1)de|- (x" + x) 


The integration techniques that we shall consider are expressed in for- 
mulas for finding indefinite integrals. The first four of these, (6.1), (6.2), 
(6.3), and (6.4), have already been used in computing definite integrals. We 
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write them down only to make them explicit. They are: 

(6.1) | [f(x) + g(x)] dx = | f(x) dx + | g(x) dx. 

(6.2) | Riixyax = k | F(x) dx, for every constant k. 


(6.3) [ax =x +c. 


r+1 
(6.4) | x ck = aI + c, where r is arational number different from —1. 


Since an indefinite integral is determined only to within an additive 
constant, (6.1) and (6.2) are open to a possible (but unlikely) false interpreta- 
tion. The precise statement of (6.1) is: If F is an indefinite integral of f and if 
G is an indefinite integral of g, then F + G is an indefinite integral of f + g. 
The proof takes one line: 


ee Oe PG = f+ 2g. 


On the other hand, if F, G, and H are three arbitrary indefinite integrals of 
tf, g, and f + g, respectively, we certainly cannot infer that H = F+ G. All 
we know is that H’ = F’+ G’, whence H= F+G+c. Similarly, 
(6.2) should be read: Jf F is an indefinite integral of f, then kF is an indefinite 
integral of kf. The proof: 


(KEY = kKPoee £7. 
The proof of (6.4) is the equation 


d qo : 
dx (5 T c) ga, 
and (6.3) is simply the special case of (6.4) obtained by setting r = 0. Note 


that each of these four integration formulas is the inverse of one of the basic 
rules for differentiation derived in Section 7 of Chapter 1. 


EXAMPLE 2. Evaluate the following three integrals: 


(i) / (2x ae =) dx, 


(ii) [o’ + 2y° + 2y + 1) dy, 


(iii) : fs?" el de. 
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Computation of the indefinite integrals follows directly from (6.1), (6.2), 


(6.3), and (6.4). Thus 
2 3° a - 2 [ xa 


i (2. + =) dx 
x2 
3 Ht 


B ie 
a 
3 x 


Since separately we would write 2{x?dx = 3x*+ c and also 2 ee 
Z 

— —-+ c, one might think that two constants of integration should appear 
% 


in the sum, i.e., that the answer should have been written 


2 2 a 2 
| (2 + 3)de= 5s “rj Rater ee 


Although this last equation is not false, it is unnecessarily complicated and 
also misleading. If F is one indefinite integral of a function, the specification 
of any other requires the specification of one additional number, not two. 
Remember that, for a given F, the set of all functions F + c such that c is an 
arbitrary real number is identical to the set of all functions F + ci + C2 
such that c,; and cy are arbitrary real numbers. The sum of two arbitrary 
constants is still an arbitrary constant. 

To do (ii), one must realize that the sum rule (6.1) implies an analogous 
rule for integrating the sum of three functions, or four, or any finite number. 
We get 


forte +a4 na fra t2/rat2frat fa 


4 3 Z 
- = ait 
eee 2 


I 


ay ay ee ec. 


Finally, to evaluate (iii), we combine the above rules of integration with 
equation (2) to obtain 


5/3 
2/3 Se 2 
| (s += (S545) 


[2(5)"? + 5] — ay” + 1 
= 3(5)7* 4 42, 


5) 


1 
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The Chain Rule provides an extremely useful technique for computing 
integrals. Suppose that F is an antiderivative of f and that g is a differentiable 
function. According to the Chain Rule, 


[F(g)]! = F’(g)g’. 


Since F’ = f, we conclude that 


[F(g)]! = F'(g)e’ = f(g)9’; 


i.e., the composition F(g) is an antiderivative, or indefinite integral, of f(g)g’. 
Thus we have proved: 


(6.5) If F is any indefinite integral of f, then 


| f(g(x))g'(x) dx = F(g(x)) + ¢. 


This formula tells us that we can integrate a function of the form 
f(g(x))g’(x) provided we know how to integrate f(x). 


EXAMPLE 3. Compute {+/x* + x + 1 (3x? + 1)dx. The integrand is the 
product of two functions. The first factor, ~/x? + x + 1, is the composition 
of x* + x + 1 with the square root, and we know how to integrate +/x. 
The second factor is 3x? + 1, which is the derivative of x* + x + 1. Hence 
(6.5) is applicable. We have 


g(x)=x +x+1, 
f(x) = Vx. 


Since 
| vxax = [2 ax = 2,7 4 ©, 


we take F(x) = 2x°/*. According to (6.5), the answer to the problem is 


Mee — er 4 x 1 te. 
That is, 


[veFest Ge 21a = 26° tae eo ee. 
We can check this answer by taking its derivative. We obtain 

d 

fee l= GO x Gx + 2), 


which is the original integrand. 
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EXAMPLE 4. Evaluate {(x? + 1)'x dx. It is possible to do this problem by 
first expanding (x? + 1)’ by the Binomial Theorem, but formula (6.5) makes 
this unnecessary. Again, the integrand is the product of two functions. The 
first is (x? + 1)°, which is the composition f(g(x)) of the two functions 
g(x) = x? + land f(x) = x’. The latter we know how to integrate: 


6 
feat s¢ 


6 
x : a: 
so we take F(x) = 2 The second factor in the integrand is x, which is not 


equal to g’(x) = 2x, but is a constant multiple of it. This is just as good 
because of the general rule {kf (x) i= k{ f(x) dx. In this case, we may write 


[o + 1)°’x dx = | o oy ade 
=} | (x° + 1)°(2x) dx 


ee | f(g(x))g’(x) dx 


= $F(g(x)) + «. 


(x? a i= 


6 
Since F(x) = - , we have F(g(x)) = : and so 
; 1\ | (x? + 1)° 
: (x + 1)xdx = (5) foe +c 
2 6 
= fee a Cc 


The derivative of the indefinite integral should be the function which was 
integrated, i.e., the integrand. Checking, we get 


d i + 1) = s4(x? + 1)'2x = (x + 1)*x. 


To summarize: Formula (6.5) is applicable if the integrand is a product 
of two functions one of which is a composition f(g(x)) and the other of which 
is g’(x) or possibly a constant multiple of g’(x). With a little practice the 
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reader should be able to recognize immediately, for example, that of the three 


integrals 
| V/x?2 + 2 dx, 


[ve + 2x dx, 
| vie 2x de, 


only the middle one can be successfully attacked by this method. 

Formula (6.5) implies an analogous fact about definite integrals. 
Called the Change of Variable Theorem for Definite Integrals, it is the follow- 
ing: 


(6.6) THEOREM. [/f both integrands are continuous functions on their respective 
intervals of integration, then é 


g(b) 


b 
| f (g(x))g’(x) dx = a LOY 


Proof. Since the integrands are continuous, both integrals exist. Let F be an in- 
definite integral of f. By the definition of the definite integral, 


b) 
9(b) = 


“5 f(y) dy = FY) 


= F(g(6)) — F(g(a)). 


g(a) 
By (6.5), 


b 


pa (g(b)) — F(g(a)). 


b 
| f (g(x) g(x) dx = F(g(x)) 
This completes the proof. 


2 


x+ 1 
—ee oee  e 
integrand is continuous on the interval [—2, 2]. Since the minimum value of 
x? + 2x + 2 is 1, which is positive, we know that the denominator is never 
zero, and so the integral is defined. Set g(x) = x? + 2x + 2and f(y) = y 2”. 
Then g’(x) = 2x + 2, g(—2) = 2, and g(2) = 10. Hence 


EXAMPLE 5. Compute dx. We first check that the 


, x+1 naif 2xt2 4 
—2+/x2 + 2x + 2 2J2/x2 + Ix +2 
2 
=4 [ _f(e@))a'a) ax 


10 
= 3 | yo dy. 
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Since [y—"? dy = 2y'/? + c, we obtain 


10 10 
af yay = y= v0 — v3. 


2 


We conclude that 


2 
Ee | = 
4 —_— dx = 10 — V2. 
L AS 


The differential of a function F was defined in Section 6 of Chapter 2, 
and was shown to satisfy the basic equation dF (x) = F’(x)dx. If F’ =f, 
we therefore obtain 


adF (x) = F'(x)dx = firs (3) 


In this section we have expressed the fact that F is an antiderivative of f by 
writing 


| foyax = FO) +6 (4) 


Equations (3) and (4) suggest that we interpret the symbol dx that appears to 
the right of the integral sign not merely as a piece of notation but as a 
differential. With this interpretation, the symbol { becomes a notation for the 
operation which is the inverse of taking differentials. Thus, for any differ- 
entiable function F, we define 


| dF(x) = F(x) +c. (5) 


If f(x) is given and we find F(x) such that dF(x) = f(x) dx, then 


| 40 dx = [ares = F(x)+ c¢. 
If a function is denoted by a variable u, the definition (5) has the simple 
form { du = u+ c. Moreover, in terms of differentials, Theorem (6.5) 


also has the following simple form: 


(6.7) If F’ = f and if u is a differentiable function, then 


[ f0 du = Fu) +c. 
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Proof. lf u = g(x), then du = g’(x) dx. The above equation therefore becomes 


| f (g(x) g(x) dx = F(g(x)) + ¢, 


and this is just (6.5). Another way of proving (6.7) is to start from the 
equation 


dF(u) = F’(u) du, (6) 


[see Theorem (6.1), Chapter 2]. From this follows dF(u) = f(u) du, and so 
Flu) +c= far = ore 


It is worth noting that Theorem (6.5) is simply an inverse statement of 
the Chain Rule. The Chain Rule was also the raison d’étre behind equation 
(6). The differential is a handy device solely because this important theorem 
is true. 


EXAMPLE 6. Evaluate the integrals 
(i) [vss 1 Dae, 


(11) [sc oly as. 
Tie) set uw = 5x + 2.. Then du = 5 dx, and so dx = = du. Hence 
| Vie# Bax = af Vanda = 4 5ul + 0 
ee See ay | ee: 
Similarly, in (ii), let wu = s? — 1. We get du = 2s ds and 
[0° — 1)'" ds = fw du = 454gu"° + 


= gies — 1)" + ¢. 


In each of these examples the reader should verify that the derivative of the 
answer gives back the original integrand. 


Each of the integral formulas (6.1), (6.2), (6.3), and (6.4) can be 
written as a fact about the integral of certain differentials. Let u and v be 
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differentiable functions and c an arbitrary constant. Then 


(6.1’) [a + dv) = | a + iE 


(6.2’) fi a= k | au for every constant k. 


(6.3’) [ae =u-+e. 


r+1 
(6.4) | i da = = i + c, where r is a rational number andr ~ —1. 


+ 
PROBLEMS 
1. Evaluate the following indefinite integrals. 
@ forest na © fax +2 a 
(b) pe Lee Ee (2) | xV/a2? — x2dx 
3x3 
(c) [ee = 20+ ya my f 42 — ee 
Ve+ 4t+5 
w fortno-96 ty | se? + 38° + 5N6 + 20d 


(e) i a dy (j) fis dx. 


2. Among the following integrals identify those that can be evaluated using the 
techniques in this section. Evaluate them. 


(a) | (« a t) ae (g) | Gr 2 DG 4D ae 


w | (vert)an my) (+ D6 +25 - 9a 
: at 
(c) [rottn'a a7 
A : ce 
(d) Ec + 7) dy G) ra? ay 
(e) [werta (k) ee 


wo [tha 
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10. 


. The curve defined by y = f(x) passes through the point (1, 4). In addition, at 


each point (x, f(x)), the slope of the curve is 8x* + 2x. Find f(x). 


. The line tangent to the graph of the differentiable function f at each point 


(x, f(x)) has slope 3x? + 1, and the graph passes through the point (2, 9). 
Find f(x). 


. If f(x) = 12x? + 2 and the graph of y = f(x) passes through (0, —2) with a 


slope of 5, find f(x). 


. Evaluate the following definite integrals. 


1 2 1 
1 2 
(b) | (20° + ndt (g) | /4 — x2 x ax 
—! oa 
1 2 
(c) | be 4 1) x ax (h) | (2|x| + 1) dx 
—A and 
2 if 
(d) Sa ae (i) | 1 ga as ee og ge ee ag 
—14/52 + 25 + 3 0 


as 3 
(e) [ (« ot 1) (2x- 4) dx (j) jes 
1 XxX 


_ If f(x) = 18x + 10 and f’(O) = 2, find f’(x). If, in addition, f(0) = 


find f(x). 


. (a) If g(x) = Vx and g’(1) = 0 and g(0) = v2, find g(x). 


(b) If f’’@ = 6and f’(1) = 8 and f’(0) = 1 and f(1) = 4, find f(. 


. If the slope of the curve y = f(x) is equal to 6 at the point (1, 4) and, more 


generally, equals 6x at (x, f(x)), what is the area bounded by the curve 
y = f(x), the x-axis, and the lines x = 1 and x = 3? 
1 


Sketch the region bounded by the curves y = aie | 
x 


eee and ».=.4: 


Find its area. 


7. Area between Curves. Let f be a function which is integrable over the 
interval [a, b] and whose graph may cross the x-axis a finite number of times 
in the interval. It was shown in Section 4 that the relation between area and 
the definite integral is given by the formula 


b 
| f(x) dx = area(P*) — area(P_ ), ( 1:) 


where P* is the set of all points (x,y) such that a<x<b and 
= y= f(x), and #5 4s-the, set of all; poimts( (x7) such that a <x < 6b 


220 INTEGRATION [cHaAp. 4 


and f(x) < y < 0. If we exclude their boundaries, then either or both of 
the regions Pt and P~ may consist of several pieces. This possibility is 
illustrated in Figure 17, which shows the graph of a continuous function f 


Figure 17 


which crosses the x-axis at the four points a, b, c, and d. For the regions 
shown in the figure, we have 


b d 
area(Pt) = | Sie +/ Fixes, 
area(P™) = — | Fx) he 


Since area(P* U P~) = area(P*) + area(P7), it follows that 


b Cc d 
area(P* U P~) = | f(x) dx — | f(x) dx + / fx) dx. 


Suppose in this example that F is an antiderivative of f on the interval [a, d]; 
i.e:; we have F(x) =f (x) for every Xia, d], “7 aen 


b 

| soyax = FO) - Feo. 

| f(x) dx = F(c) — FO), 
d 

‘ f(x) dx = F(d) — F(c). 


Hence the total area is given by 


[FQ) — FO] a FO> FOLAE | -Fa 
ae F(a) ae 2 ©) 4 DPOF. 


area(P* U P7-) 
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In this section we shall extend (1) to compute the areas of regions with 
more complicated boundaries. The principal result here is the following: 


(7.1) Let the functions f and g be integrable over a closed interval [a, b], and 
suppose that g(x) < f(x) for every x in [a, b]. If R is the set of all points 
(x, y) such thata < x < band g(x) < y < f(x), then 


b 
area(R) = | [f(x) — g(x)] dx. 


Proof. The region R is illustrated in Figure 18(a). The only complicating feature 
of the proof is the fact that g or f, or both, may take on negative values in the 
interval [a,b]. If this happens [as it does in Figure 18(a)], we shall simply 


y=f*(x) 


y = g*(x) 


(b) 


Figure 18 


slide, or translate, the graphs of g and f upward so that the region between 
them lies above the x-axis and its area is unchanged. The sliding, or transla- 
tion, is shown geometrically in Figure 18(b), and can be accomplished ana- 
lytically as follows. Let / be a lower bound for the values of the function g 
on the interval [a, 4]; i.e., we choose a number / such that / < g(x) for every 
x in [a, b]. Such a number certainly exists since g is assumed to be integrable 
over [a, 5], and is therefore bounded on [a, 5]. In addition, if it does happen 
that g(x) > 0 for every x in [a, b], then we take / = 0. We now define func- 
tions f* and g* by 

f*@) = fe) + (A, 

g* (x) = g(x) + lI. 


Their graphs and the region between them, which is denoted by R*, is shown 
in Figure 18(b). It is obvious that 


area(R) = area(R*). 
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Let P be the set of points bounded by the graph of g*, the x-axis, and the lines 
x = aandx = b. By the integral formula for area, we know that 


b 
area(P U R*) = / f*(x) dx, 


b 
area(P) = / g™*(x) dx. 


Since area(R*) = area(P U R*) — area(P), we obtain 


b b 
area(R) = area(R*) = | fy dx = / g*(x) dx 


b 
j [f*(x) dx — g*(x)] dx. 


However, 
F*(x) — g*@) = [f@) + |!) — [e@) + [I] 
= IL) a 


Hence 


b 
area(R) = / Lf) — pools, 
and the proof is complete. 


It should be remarked that, rigorously speaking, the above proof omits 
some logical details. These occur in our use of the function area. To begin 
with, we have not in this book attempted to give a mathematical definition of 
the area of a set, although we have shown that if area does exist and satisfies 
certain simple properties, then the integral formula (1) is valid. Moreover, in 
the preceding paragraph we have tacitly assumed some of the properties of 
area which are obvious to the intuition, but logically would require proof. 
For example, we assumed that area is invariant under translation when we 
asserted that area(R) = area(R*). In the same way, our statement that 
area(R*) = area(P U R*) — area(P) [see Figure 18(b)] was based solely on 
geometric intuition. There is nothing wrong with omitting these details, but 
it is important that a careful reader realize that the omissions are there. 


EXAMPLE |. Compute the area of the region R bounded by the graphs of the 
functions f(x) = 2x — 1 and g(x) = —(2x — 1) and the lines x = 1 and 
x = 2. Since 


g(x) < 0, for every x, 
JOP Seen = ae = 7, 
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it follows that g(x) < f(x) on the interval [1, 2]. Both functions are con- 
tinuous and hence integrable, and so (7.1) is applicable. We obtain 


area(R) = | [f(x) — g(x)] dx. 


Since f(x) — g(x) = (2x — 1) + (2x — 1) = 4x’ — 2x, the answer is 
2 


2 
area(R) = | (4 = Ix\dx = (2° — x’) 
a 1 


= (2 — 4)— 4-1) = 42. 


EXAMPLE 2. Find the area of the region R lying between the lines x = —1 
and x = 2 and between the curves y = x”? and y = x*. We first sketch the 


y-axis 


R=R,UR, 


l zZ x-axis 


Figure 19 


lines and curves in question and indicate the region R by shading (see Fig- 
ure 19). Observe that 


fd ee Ed, 


x < x, 
Se i ae eee 
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Writing R as the union of regions R; and Ro, as is done in the figure, we have, 
by (7.1), 


if 
area(R,) = | (x = ee 
—l 


area(R2) = | (if =e ae 


Hence 


area(R) = area(R,) + area(R:2) 


ig (Of ge a = | QS a ex. 


is clearly an indefinite integral of x? — x’. 


| 
aa a 


Thus 


1 


= eS ee, 


| (x” — x*) dx = F(x) ; 


ae if a =a ia 


2 


= +F(Q) + FAY, 


1 


re 
Pen 
a 
| 
oe 
eng 
> 
| 


— F(x) 


Finally, therefore, 
area(R) = FC) — F(—1) — FQ) + Fd) 


1 v6 4 ces eae 
73 ‘t ge 3-t te = 7. 


Because of symmetry, each of the integral formulas for area which we 
have developed has an obvious counterpart for functions of y. For example, 
if fis the function of y whose graph is drawn in Figure 20, then 


d 
| f(y) dy = area(P*) — area(P_), 


where P* and P~ are the regions indicated in the figure. 

Sometimes a curve in the xy-plane can be defined as the graph of a 
function of y and not as a function of x. An example is the parabola defined 
by the equation (y — 1)’ = x — 1 and illustrated in Figure 21. Although 
this equation cannot be solved uniquely for y in terms of x, it is easy to do the 
Opposite. We get 


x=“ bf SPS yeas, 
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and so the curve is the graph of the function f defined by f(y) = y? — 2y + 2. 
The area of the region P bounded by the parabola, the two coordinate axes, 
and the horizontal line y = 2 is given by 


area(P) = | fy) dy = i (y — 2y + 2) dy 


3 
ae: 6 Sa. 


y-axis 


2 


ee. 


Figure 20 


ree rt) 


y=2 


Figure 21 


EXAMPLE 3. Express the area of the shaded region Q in Figure 22(a) as a 
sum of definite integrals. There are many different ways to do this. We shall 
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begin by subdividing Q into four subregions Q,, Qo, Q3, and O, using the two 
vertical lines x = a, and x = az as shown in Figure 22(b). 

Consider for a moment the region Q, alone. Every horizontal line L 
which intersects Q, cuts its boundary in at most two points. Suppose that L 
crosses the y-axis at the point (0, y). Moving along L from left to right, we 
denote the x-coordinate of the first point encountered on the boundary of 
Q, by gi(y). In this way a function g; is defined whose graph forms the 
“western boundary” of Q,. In fact, Q; is the region between y = b, and 


(c) 


Figure 22 


y = by [see Figure 22(b)] and also between the curves x = gi(y) and x = ay. 
By the counterpart of (7.1) for functions of y, we therefore obtain 


bo 
area(Q,) = : [a1 — gi(y)] dy. 


Turning next to the region Qs, we see that the “northern boundary” 
is the graph of a function f;, and the ‘“‘southern boundary” is the graph of a 
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function fo. These functions are indicated in Figure 22(c). Hence 


area(Q2) = | [fi(x) — fa(x)] dx. 


Similarly, we have 
area(Q3) = | [Ai(x) — ho(x)] dx, 


and also 
area(Q4) = | [go(y) — a2] dy. 


Since area(Q) = area(Q,) + area(Q:) + area(Q3) + area(Q,), we there- 
fore obtain finally 


bo 


area(Q) = d [a1 — gi(y)] dy + | [fi(x) — fo(x)] dx 


a | [Ai(x) — ho(x)] dx + | [g2(y) — a] dy. 


PROBLEMS 


1. For each of the following functions and intervals: Compute { : f(x) dx; draw 
the graph of f; label the region Pt on or above the x-axis which is bounded by 
the curve y = f(x), the x-axis, and the lines x = a and x = 5; label the analo- 
gous region P~ on or below the x-axis; evaluate area(P*) and area(P—); and 
check formula (1) at the beginning of the section. 


(a) f@®) =x- a= Oanddi= 4, 

(b) f(x) = —x? + %x+ 2,a = 0,and db = 3. 
(c) f(x) = —-x? + x+2,a = —2,and bd = 3. 
id) Jin = @=— Da -),.and 4 = 2. 


2. In each of the following find the area of the subset P* U P~ of the xy-plane 
bounded by the curve y = f(x), the x-axis, and the lines x = a and x = BD. 


(a) f(x) = x*°,a = —1,and 5 = 1. 
(h). fy = < — 3x + 2,0 — 0. and 6b = 2. 
(c) f(x) = @& + I& — I)&@ — 3), a = 0, and b = 2. 
@ f@) = | = 1,e@ = —2, and b = 2. 
3. Let f be a continuous function. Using areas, show that 
(a) If fis an odd function, then f*, f(x) dx = 0. 
(b) If fis an even function, then { = fide = 2F : f(x) dx. 
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10. 


it, 


iz: 


13. 
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. Prove (a) and (b) of Problem 3 analytically using the Fundamental Theorem of 


Calculus. [More specifically, use Theorems (5.2) and (5.3).] 


. Draw the region R bounded by the lines x = Oand x = 2 and lying between the 


graphs of the functions f(x) = x + 2 and g(x) = (x — 1)’. Find the area 
of R. | 


. Draw the region Q bounded by the curves x = 0, 3y — x + 3 = O, and 


3y + 3x? — 8x = 3. Compute area(Q). 


_ Find the area of the subset R of the xy-plane lying between the lines x = 4 


and x = 2, and between the graphs of the functions f(x) = = and g(x) = x’. 
eC 


Draw the relevant lines and curves and indicate the region R. 


. Find the area of the region bounded by the two parabolas y = —x? + x + 2 


and y = x? — 2x. 


. Draw the graphs of the equations y = x? and y = 4, and label the region R 


bounded by them. 


(a) Express the area of R as an integral with respect to x using (7.1). Evaluate 
the integral. 


(b) Similarly, express the area of R as an integral with respect to y using 
the counterpart of (7.1) for functions of y. Evaluate the integral and 
check the answer to (a). 


(a) Iff(y) = —y? + y + 2, sketch the region bounded by the curve x = f(y), 
the y-axis, and the lines y = 0 and y = 1. Find its area. 


(b) Find the area bounded by the curve x = —y” + y + 2 and the y-axis. 


(c) The equation x + y? = 4 can be solved for x as a function of y, or for y 
as plus or minus a function of x. Sketch the region in the first quadrant 
bounded by the curve x + y? = 4, and find its area first by integrating a 
function of y and then by integrating a function of x. 


The function fis continuous at every point of the interval [a, b] and may cross 
the x-axis at a finite number of points in the interval. Let P+ and P~ have their 
usual meaning [as in formula (1)]. 


(a) Is |f(x)| continuous at every point of [a, 5]? 


(b) Show that 
b 
area(P* UP) = | | f(x)| dx. 


Find the area of the region bounded by the parabola y = x’, the x-axis, and the 
line tangent to the parabola at the point (2, 4). Do the problem 


(a) using x as the variable of integration. 
(b) using y as the variable of integration. 


Do Problem 12 for the line tangent to the parabola at the general point (a, a’). 
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2 2 
14. Express the area of the ellipse ~ + = = 1 as a definite integral of a function 
x 2 


of x, and as a definite integral of a function of y. (The resulting indefinite inte- 
grals cannot be evaluated with the theory so far developed.) 

15. Find the area of the shaded region in Figure 23. The curves are parabolas. The 
inscribed square has area 4, and the circumscribed square has area 16. 


Figure 23 


8. Integrals of Velocity and Acceleration. In this section we shall develop 
some of the integral formulas associated with velocity and acceleration. 
Among these is the formula for the distance traveled by an object, or particle, 
which moves with velocity v(t) during the time interval from ¢ = atot = b. 

Consider a particle which, during some interval of time, moves along a 
straight line. We take the straight line to be a coordinate axis of real num- 
bers, and denote the position of the particle on the line at time ¢ by s(t). 
Thus s is a real-valued function of a real variable with value s(t) for every tf 
in some interval. We shall assume that s is differentiable. In Section 3 of 
Chapter 2 the velocity v of the particle is defined to be the derivative of s. 
That is, 


v(t) = s'(0). 


Equivalent to this equation is the statement that s is an antiderivative of v. 
It follows that 


sf) = [ow dt +c. (1) 


If we consider the motion of the particle from tf = a to t = b, and add the 
assumption that v is continuous on [a, b], then, by Corollary (5.3) of the 
Fundamental Theorem of Calculus (see page 204), we obtain the formula 


b 
s(b) — s(a) = | v(t) dt. (29 
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EXAMPLE |. An object dropped from a cliff at time t = 0 falls with a velocity 
given by v(t) = kt. If we take the direction of increasing distance to be down- 
ward, and measure distance in feet and time in seconds, then k = 32 and v(f) 
is in units of feet per second. How high is the cliff if the object hits the bottom 
3 seconds after being dropped? The height equals the difference s(3) — s(0). 
Hence 


s(3) — s(O) = | v(t) dt 


3 


5) 
| 32t dt = 16f° 
0 


144 feet. 


0 


Acceleration is defined in Section 3 of Chapter 2 to be the derivative of 
velocity. Thus 
a(t) = v'(t), 


and, as before, an equivalent statement is that velocity is an antiderivative 
of acceleration. We therefore have the formula 


v(t) = [aw dt +c. (3%) 


EXAMPLE 2. A body in free fall under the earth’s gravitational pull falls with 
a constant acceleration g, equal in magnitude to 32 feet per second per 
second. Suppose that at time t = 0 a ball is projected straight up from the 
ground with.an initial velocity v) = 256 feet per second. Write formulas for 
the subsequent velocity v(t) and distance from the ground s(t). What is the 
maximum height the ball attains? In this example we shall choose the direc- 
tion of increasing distance to be upward. As a result, the gravitational 
acceleration is negative, and the starting point of our calculations is the 
equation a(t) = —32. We have 


bt); = | acta +c= [esa +c 
—32t+c¢, 


whence 
v(0) = —32°0+ c= Cc. 


The initial velocity is given as Up = v(O) = 256 feet per second. Hence 
c= 256 and 
u(t) = —32t + 256, 
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which is one of the formulas asked for. The second integration yields 


sr) = [ow dt +c 


fe + 256)dt+c 
<b 2 5et- <-. ¢. 


The constant of integration c in the preceding equations has, of course, 
nothing to do with the one obtained from integrating a(t). Here s(0) = 
—16-0> + 256:0+ c=. Since the ball is at ground level when ¢ = 0, 
we conclude that 0 = s(0) = c, and so 


s(t) = —162f? + 2561, 


which is the second formula required. To find the maximum value of the 
function s, we compute its derivative and set it equal to zero: 


s(t) = v(t) = —32t + 256 = 0, 
whence it follows that 


t = +38 = 8 seconds. 


Since s’’(t) = v(t) = a(t) = —32, which is negative, we know that s has a 
local maximum when ft = 8, and it is easy to see that this local maximum is 
an absolute maximum. Thus the maximum height attained by the ball is 
equal to 


—16-8? + 256: 8 
-_ cnn 18 ae Qu 
= 1024 feet. 


5(8) 


It is important to realize that the quantity s(b) — s(a) in (2) does not 
necessarily equal the distance traveled by the object, or particle, during the 
time interval from t = ato t = b. This is because the number s(f) simply 
gives the position of the particle on the line at time ¢. Thus in the preceding 
example of the ball we showed that 


s(t) = —162f? + 256¢. 
Substituting t = 0 and ¢ = 16, respectively, we get 


a0) = in oe OO 
s(16) = =16 * 16 + 256-16 = 0. 
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The interpretation of these equations is clear: The ball left the ground at 
time ¢ = 0, and 16 seconds later it had fallen back. However, an insect who 
accompanied the ball on its flight would probably not report to his admiring 
friends and relatives that the total distance traveled was 


s(16) — s(0) = 0 — O = O feet. 


A similar situation is an automobile trip 10 miles down a road and back 
again. The distance traveled is presumably 20 miles, and not zero. 

It is not hard to guess the proper modification of formula (2) to obtain a 
true distance formula. If we denote the distance traveled during the time 
interval from t = ato t = b, where a < b, by distance|°, then 


; b 
distance| = | |v(t)| dt. (4) 


a 


Logically, however, there is no way to prove (4), since we have not given 
a mathematical definition of distance|?. As a result, we shall take (4) as a 
definition after checking that it corresponds to our intuition. First of all, if 
u(t) does not change sign from t = ato t = Bb, then 


b b 
| |u(t)| dt = f v(t) dt 


(See Problem 4 at the end of this section.) Hence, by formula (2), 


b b : 
| |v(t)| dt = f v(t) a = |s(b) — s(a). 


The assumption that v(t) does not change sign means that the direction of the 
motion does not change. In this case, we would certainly expect |s(b) — s(a)|, 
which equals the distance on the real line between the initial position s(a) 
and the final position s(b), to be the total distance traveled. 

Further motivation for the definition in formula (4) is obtained by going 
back to the definition of the definite integral. We assume that the function 
[v(t)| is integrable over the interval [a,b]. Consider an arbitrary partition 
CO. = {hts ntelay Ola; 8) such that 


=i ih oo * So 


For eachi = 1,...,, we denote by M; the least upper bound of the set of 
all numbers |v(t)|, where f is in the subinterval [t;_1, t,]. Similarly, let m, 
be the greatest lower bound. Thus the number M; is the maximum speed of 
the particle in the subinterval [t;_1, ¢;], and m; is the minimum speed. In- 
tuitively, therefore, the distance traveled by the particle during the sub- 
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interval of time [¢;_1, t;] must be less than or equal to M,(t; — t;_1) and 
greater than or equal to m,(t; — t;_1). Consequently, the upper and lower 
sums, 


= » M(t; — ti_-1), 


qt 
LS > mG; — 14), 
1=1 


are upper and lower bounds, respectively, of the total distance traveled. But 
the function |v(t)| has been assumed to be integrable over [a, b]. It follows 


b 
that there exists one and only one number, { _ (U(t)| dt, such that 


b 
Lee | |u(t)| dt < U,, 


a 


for every partition o of [a, b]. This justifies the adoption of formula (4) as the 
definition of distance|?. 


EXAMPLE 3. A particle moves on the x-axis, and its position at time ¢ is given 
by x(t) = 28 — 212? + 60 — 14. Its velocity is the function v defined by 


v(t) = x(t) = 6f — 42t + 60 
= 6(? — 7t+ 10) = 6(¢ — 2)(t — 5). 
Find the total distance traveled by the particle during the time interval 


{= Otot= 6. Clearly, v(t) = 0 when ¢ = 2 and ¢t = 5, and the sign of 
u(t) is as indicated by 


+ +4 ----- + + + + sign of v(t) 
Hence 
v(t) if—o <t< 2, 
lu(t)| = 4 —v(2) oe PLD, 
v(t) 1 oe Ex oe. 
Consequently, 


distance 


| |v(t)| dt 


9 ad 


= [ ema | 


v(t) dt + | v(t) dt. 
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Since x(t) is an antiderivative of v(t), 


| v(t) dt = x(2) — x(0), 


| v(t) dt 


= (5) ~ x12), 
| v(t) dt = x(6) — x(5), 
and so 
| |v(t)| dt = —x(O) + 2- x) — 2+ x(5) + x(6). 
Substitution in the equation for x(t) yields x(0) = —14, x(2) = 38, 


x(5) = 11, and: x(6) = 22. Pence 


6 6 

= | ear 

0 0 

144+ 2-38 —2-11+ 22 


= 90. 


distance 


The motion described in this example is the same as in Example 1, page 
105. By looking at Figure 16, Chapter 2, one can see that the distance traveled 
by the particle during the time interval [0, 6] agrees with the value just ob- 
tained. 


The integral formulas derived in this section all presuppose motion along 
a straight line. The reason for this restriction is that the definition of velocity, 
and consequently of acceleration, has been based on the possibility of re- 
presenting the position of the particle at time ¢ by a real number s(t) along a 
coordinate axis. A coordinate system on a line in turn is defined in terms of 
the distance between two points, and thus far the only measure of distance 
between points which we have is straight-line distance. In Chapter 10 we 
shall introduce the notion of arc length along a curve and shall study the 
notions of velocity and acceleration for curvilinear motion. At this point, 
however, it is worth noting that if we think of obtaining a curve by bending a 
coordinate axis without stretching it, and if s(t) measures position on the 
curve in the obvious way, then formulas (1), (2), (3), and (4) still hold. Thus 
if the speedometer reading on a car is given by some nonnegative and inte- 
grable function f of time, then the distance traveled during the time interval 
[a, b] is equal to (fa dt whether the road is straight or not. 

We conclude this section with another type of problem illustrating the 
integration of rates of change. 
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EXAMPLE 4. Air is escaping from a spherical balloon so that its radius r is 
decreasing at the rate of 2 inches per minute. Find the rate of change of the 
volume V as a function of time if we are given that r = 10 inches when ¢ = 0. 
What is the volume of the balloon when t = 2 minutes, and at what time will 
V = 0? The volume of a sphere is given by V = 4rr°, and we are given that 


dr 
= ae 
Hence 
, = [fate= [inate 
dt 
= —2t+c. 


Setting ¢ = 0, we see that the constant of integration c is the value of the 
radius at t = 0, namely, 10 inches. Hence 


r= —2t+ 10. i 
Since 
ES a ee 
BPRS Ops se dt 
we obtain for the answer to the first part of the problem 
sid = 4r(—2t + 10)°(—2) = —32e(—7t + 5)’, 


: oe dV . oaks 
where ¢ is measured in minutes and a in cubic inches per minute. Clearly, 


V(t) = Sn[r(QP = 4n(—2t + 10)°. 
The volume of the balloon when t = 2 minutes is therefore 
V(2) = $n(—2:-2 + 10)? = 416° = 2882 cubic inches, 
and the volume will be zero when 
V(t) = $n(—2t + 10)° = 0, 


Lc., when f= 5 minutes. 


PROBLEMS 


1. A straight highway connects towns A and B. A car starts at t = 0 from A 
and goes toward B with a velocity given by v(t) = 60¢ — 12°, measured in 
miles per hour. When the car arrives at B, it is slowing down and its speed is 
48 miles per hour. 


(a) How far apart are the two towns? 


(b) What are the maximum and minimum speeds obtained during the trip? 
When are they reached, and at what distances? 
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. A straight highway connects towns A and B. A car, initially stopped, starts at 


t = 0 from A and accelerates at 240 miles per hour per hour until reaching a 
speed of 60 miles per hour. 


(a) How long does this take, both in time and distance? 

Assume that the car travels at the constant speed of 60 miles per hour once it 
has reached that speed, and that it slows down to a stop at town B in the same 
way that it left A. 


(b) How far apart are A and B if the whole trip takes 5 hours? 


. A projectile is fired straight up with an initial velocity of 640 feet per second 


(see Example 2). 
(a) Find the velocity v(f). 
(b) How far does the projectile travel during the first 10 seconds of its flight? 


(c) How far does the projectile go, and how many seconds after takeoff is this 
maximum height reached ? 


(d) What is the total distance traveled by the projectile during the first 30 
seconds of its flight? 


(e) What is its velocity when the projectile returns to the ground? 


. Let the function f be integrable over the interval [a, 5], and suppose that f(x) 


does not change sign on the interval. Prove that 


b b 
| If@)| dx = | f(x) dx| - 


(This is an easy problem. Consider separately the two cases: First, f(x) > 0 
for €évery x in [a, 5], and second, f(x): <-Otor every x in [a, 5]. 


. A particle moves on the x-axis with velocity given by v(t) = —4t + 20. 


(a) In which direction is the particle moving at time t = 0? 


(b) Find»s(t), the position of the particle at time f¢, if its coordinate is —30 
when ¢ = 1. 


(c) Find the distance traveled by the particle during the time interval from 


t= Otor = 4. 
(d) Find the distance traveled by the particle during the time interval from 
t= Otot = 8. 


(¢) When is-s(f);= 07 


. A particle moves on the y-axis with acceleration given by a(t) = 6t — 2. 


Denote its velocity and position at time ¢ by v(f) and y(t), respectively. At time 
t = 1, the particle is at rest at the zero position. 


(a) Find v(t) and y(?). 


(b) How far does the particle move during the time interval from t = 1 to 
t = 3? 


(c) What is the distance traveled by the particle from t = —1 tor = 2? 
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10. 


11. 


. A road borders a rectangular forest, and a car is driven around it. The car 


starts from rest at one corner and accelerates at 120 miles per hour per hour 
until it reaches the next corner 15 minutes later. The second side is 20 miles 
long and the car is driven along it at constant velocity equal to the final velocity 
reached on the first side. The car continues at this same speed on the third side. 
On the fourth side, however, the car slows down with a constant acceleration 
and comes to a stop at its original starting place. Find 


(a) the dimensions of the rectangle. 
(b) the acceleration on the fourth side. 


(c) the time taken for the whole trip. 


. Let the function f be integrable over the interval [a, 5]. From the definition of 


integrability in Section 1, prove that J . fis the only number such that 


b 
het: fs U,, 


for every partition o of [a, bd]. 


. A conical funnel of height 36 inches and base with radius 12 inches is initially 


filled with sand. At t = 0, the sand starts running out the bottom (apex of the 
cone) so that the volume V of sand remaining in the funnel is decreasing at the 
constant rate of 10 cubic inches per minute. 


(a) Find V asa function of time ¢, and determine how long it takes for all the 
sand to run out. 


(b) Assuming that the sand retains its original conical shape during the 
process, find the radius r of the base of the cone of sand as a function of f. 


A particle moves on the parabola y = x’, and its horizontal component of 


velocity is given by x’(t) = t > 0. At time ¢ = 0 the particle is at 


5 ey 
(¢+ 1)?" 
the origin. 


(a) What are the x and y coordinates of the particle when ¢ = 1? When 
f= a7 


(b) As ¢ increases without bound what happens to the particle? 


At time t = 0, an object is dropped from an airplane which is moving horizon- 
tally with velocity vy. As the object falls, its horizontal component of velocity 
x’(f) is constant and equal to vp. Its downward acceleration y’’(t) is constant 
and equal to —g. Measure the positive direction of x in the direction of motion 
of the airplane and the positive direction of y upward. Also assume that 
(0) = yO) = 0. 


(a) Find x(t) and y(%). 


(b) By eliminating ¢ from the equations in (a), find the equation in terms of x 
and y in which the object falls. What is the name of the curve? 


(rad ER le 


Logarithms and 
Exponential Functions 


1. The Natural Logarithm. If fis any real-valued function of a real variable 
which is continuous on some interval, we have seen that there exists an anti- 
derivative, or indefinite integral, F such that F’(x) = f(x) for every x in the 
interval. For some functions we have been able to write their antiderivatives 
explicitly. For example, if f(x) = x” and r is a rational number different 
from —1, then the general antiderivative is 


|x or 
a 


where c is an arbitrary constant. This formula is not applicable if r = —1. 


Ly :; 
Nevertheless, — is certainly continuous on the interval (0, «), and therefore 
x 


some function has it for a derivative. For every positive number a, we can 
define such a function by writing 


Pigy = | a, for every x > 0. 


For then by the Fundamental Theorem of Calculus [see Theorem (5.2), 
page 200], ' 


x 


dt ee 
To be specific, we choose a = | and select | - for a particular antideriy- 
1 


, I POC Ta 
ative of —. As we shall see, this function is interesting enough to have a 
x 


Special name and a special notation. The notation we shall use is In(x), or, 
more briefly, In x, and we shall save the name and reason for its choice until 
we have investigated its properties. For now, we define 


rine fest for every x > 0. 
1 


238 


sEC.1] THE NATURAL LOGARITHM } 239 


The reason that x must be positive in this definition is that the function 


has a discontinuity at = 0. If x is negative, the integral does not exist and 
In x is not defined. 
Geometrically, In x is an area or the negative of an area. If x > 1, then 


l 
In x is the area of the region bounded by the hyperbola y = > the f-axis, 


and the lines t = 1 and t¢ = x [see Figure 1(a)]. On the other hand, if 
Q < x < 1, we have 

Be at: 

giagose _ 


ux a 


and In x is the negative of the area shown in Figure 1(b). Thus we have the 
following properties of the function In. 


heer So: - 
(1.1 ) ln x >, eS ee 
nx =o, 2 Bae oe ee 


] x t-axis x 1 t-axis 
(a) (dD) 


Figure | 


Another interesting property is obtained by taking the derivative 


d 
re In kx, where k is an arbitrary positive constant. By the Chain Rule, 
a 


d te l l 
‘ues ato ia aes 


Hence the functions In kx and In x have the same derivative. We know that 
two functions which have the same derivative over an interval, in this case 
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all positive real numbers, differ by a constant. Hence 
In kx — in x = , 


for some real number c and all positive x. We evaluate c by substituting a 
particular value for x. Since the value of In 1 is known, we let x = 1, getting 


lnk &— int =e. 


Since In 1 = 0, we know that c = Ink. Hence Inkx — Inx = Ink, or, 


equivalently, Inkx = Ink + Inx 


In deriving this equation we have used the fact that k is a constant and x a 
variable. Once we have derived it, and know that it is valid for every positive 
k and x, we can forget the distinction and write 


(1.2) Inab=Ina-+In8B, for all positive real numbers a and b. 


ae ‘4 a ee J 
If a is positive, then — is also positive and a substitution of — for b in 
a a 


1 

(1.2) gives the equation In 1 = Ina + In—. Since In 1 = O, we have 
a 

(1.3 ) In- = — Ina, a> %. 


1 
Applying (1.2) to the product of the two numbers a and B? and using 


l l 
(1.3), we obtain Ins = In (« 5) =Ina+ Ing = Ina ~ In db. That 1s, 


(1.4) In? =Ina—Inb, a> Oandb > 0. 

In summary: The function In applied to a product is equal to the sum of 
the values obtained when the function is applied to the factors. Applied to 
the quotient, the value of In is the difference between the values of the func- 
tion applied to the numerator and to the denominator. 


ee 
Let r be any rational number, and consider the derivative — In x’. 
By the Chain Rule again, o 
Han i ae oe = Aoi = ae 
dx MEX x = 
Moreover, 
ae ae = oe = 2s 
dx aS :) a” 
Thus In x” and rln x have the same derivative and so must differ by a con- 
stant: 
lexi vn ws & 
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Substitution of 1 for x tells us that c = 0, and therefore In x” = rInx for 
every positive number x. For uniformity in appearance with properties 
(1.2), (1.3), and (1.4), we set x = a and obtain 


(1.5 ) Ina” = rina, for every rational number r and 
every positive real number a. 


The properties we have derived for the function In should be recognized 
by anyone who has been exposed to logarithms. However, In is not the same 
as the function log:) or the function logs, which the student may have seen 
earlier. It has many of the same properties though and is called the natural 
logarithm, hence the abbreviation In. 


d l 
We next draw the graph of the function In. Since 2 ix = = and-simce 
I a fie Ps 
—is positive for all x for which In x is defined, the slope of the tangent line to 
x 


the graph is always positive. From this we can see that In x always increases 
as x increases; that is, the function |n is strictly increasing. To prove this 
analytically, we use the Mean Value Theorem, page 113. Suppose that 
0 <a < b. By the Mean Value Theorem there exists a number c such that 
amc <0 and 


Inb—Ina= (6 —- a) (4 In x) (Cc) = = a) oo, 


Hence Ina < In b, and the monotonicity of the natural logarithm is proved. 
2 


d ft | 
Since He In x = — —, the graph of In x is concave downward for all x. 
x a 


Figure 2 


With this information and the fact that In | = 0, we can make a reasonable 
sketch of the graph. This appears in Figure 2. Note that the graph is steep 
and the values of the function negative for very small values of x. The values 
increase and the curve goes up to the right, passing through (1, 0) with a 
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d | Jiisag 
slope of 1. Since ae In x = —, the slope decreases as x increases. This raises 
Ix x 


the question of whether In x becomes arbitrarily large as x increases without 
bound, or possibly tends toward some limiting value. However, it is obvious 


ie aera 4 ; 
that the inequality = = 71s true for every real number in the closed interval 


[1,2]. It follows from one of the fundamental properties of the integral 
[specifically, from (4.3) on page 191] that 


- =f 
ae = dt. 
fdas fda 


The left side of this inequality is equal to In 2, and the right side to 5. Thus 
we have proved that 


ln-O2ie 1 F 
[a fact which can also be obtained geometrically by looking at Figure 1(a) 
1 : 
and considering the area under the curve yp = p between ¢ = 1 and t = 2]. 
Hence for every rational number r, we have In 2” = rIn2 > : . By taking r 


large enough, we can make é and, consequently In 2”, as large as we like. We 
conclude that 


hel x oe, 


TO 


EXAMPLE |. If In 2 = aand In3 = 5, evaluate each of the following in terms 
of a and b. 


(a) 1n4, (c) Ind, (e) Ing, 
thy. le (d) In 24, (f) In 23", 


Using the various properties of logarithms developed above, we find 


(a) In4 = In 2? = 2inZ = Za, 
(b) In6é = In2:3) = In2+In3=a-4+ 5b, 
(c) Ing = —In6 = —(a+ b) = —-a-b, 


(d) In 24 = In(8- 3) = In 8 + In 3 = In 2? + In 3 = 3 In2 + In 3 
= 3a+ Jb, 


(ce) Iné = In2 — In9 = In2 — In 3? = In2 — 21n3 = a — 28, 
(foo In 273? = 2” 4 In 3” = min? +n ins aa 
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The derivative of the natural logarithm of a differentiable function is 
found by means of = Chain Rule. Thus if F(x) = In(f(x)), then F'(x) = 


in’ (Oi) = ——- = a 


a differentiable eo of x, then the same result can be written 


If the variable u is used to denote 


EXAMPLE 2. Find the derivatives of (a) In(x? — 3), (b) In\V/4x + 7. Using 
the Chain Rule in (a), we obtain 


de 2 a 
yo he eG : 
Note that the original function, In(x? — 3), is defined only for |x| > V3, 


although the function which is its derivative can be defined for all x except 
+ /3. We can do (b) either by use of the Chain Rule directly, as 


d i ek ee —1/2 . Geant f 
7invaxt T= a, (4x + 7) ae oe ey, 


or more simply by noting that In./4x + 7 = 41n(4x + 7). Then 


5s 2 eee 


a InV/4x + 7 = SS digas oo i= a Ax 5 a7 : 


dx s+ 


The latter method would be much shorter for finding the derivative of 
In(Qx? + 2)(x — 3)(x + 5)3. The expanded form In(x? + 2) + In(x — 3) + 
3 In(x + 5) is certainly simpler to differentiate. 


Since the natural logarithm is defined only for positive numbers, it 
; Nie I 
follows that the function In x is an antiderivative of — only for x > 0. It is 
I , ee 
natural to ask whether or not — , which is defined and continuous for all x 
x 


except 0, has an antiderivative for x < 0. The answer is yes; In(— x) is an 
antiderivative. Of course, if x is negative, then — x is positive and so In(— x) 
is defined. By the Chain Rule, 


pa 2 


d 
a eas a) 
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! Bit Su I 
This equation may be combined with the equation a In x = —, which holds 
for all positive x, into the single equation . gs 


@ inilx| = = > for all x except 0. 


dx 


: 
x 


The corresponding formula for the indefinite integral is 


(1.6 ) 


If fis a differentiable function, the Chain Rule implies that 


d a ee ae se 
gn PO ya a 


Hence, we have the integration formula 


(1.7) iY dx = In| f(x)| + c. 
dx x+7 
EXAMPLE 3. Integrate (a) | i. (b) | Pea egal dx. To do (a), we use 


(1.6) and the fact that the integral of a constant times a function is the con- 
stant times the integral of the function. 


ax ol fee 

[B= 3] S-ympi+e 
For (b), formula (1.7) is applicable because the numerator is $ times the 
derivative of the denominator. 


oo k+tT 1 | See ot sia 
jottl«-| | SS eae ae ae aS) ie 


Alternative ways of writing the integration formula (1.7) are obtained 
by letting u = f(x). We then get 


| cae = Inju| +c. 


u dx 


d 
Using the theory of differentials, we have du = Pk and the formula 
becomes . 


du 


— = Inju| +c. 
u 
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2 2 


a d 
EXAMPLE 4. Compute the definite integral | pee 


Led ios, 
He ee etu = x ly 


Then da = 3x¢x, or, equivalently, x°dx = a Hence 


x dx _1du_1\ du, 
Ss Sake Tea eal a 
and so 
2 
x ax T+ du | 
iz | = Line + 6 
= sln|x*— 17| +. 
Finally, therefore, 
x” dx l 3 : l l 
1 l 1 9 
= 3 ind 3 inl? = 3 Inia 


Note that, if we had neglected the absolute value, we would have encountered 
the undefined quantities In(—9) and In(— 17). 


Since logio10 = 1 and log,2 = 1, it is reasonable to ask for that value 
of x for which In x = 1 and to call this number the base for natural logarithms. 
We know that such a number exists, since In x increases as x increases and 
since the graph of y = In x crosses the line y = 1. An accurate graph would 
show that the abscissa of the point where they cross is between 2.7 and 2.8, 
more accurately between 2.71 and 2.72, and more accurately still between 
2.718 and 2.719. Although this number is irrational, we can find better and 
better decimal approximations to it. They start out 2.71828.... This num- 
ber, denoted by e, is therefore defined by the equation Ine = 1. 


l 1 l 
The sequence of rational numbers 1 + 1, 1 + 5? 1+ 3? 1+ geri 


| 
Per See: approaches 1 as a limit. The chord connecting points (1, 0) 


l l 
and (i +-—, In (1 + )) on the graph of y = In x has a slope equal to 
n n 


n(1 +4) — | 
A = nin(1 +4). 
1 n 
(a= 
n 
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Ey l 
Hence, by (1.5), the slope is In (1 + -) . Asn— o, we have —- — 0, and 
n 


the chord approaches the tangent to the curve at (1, 0), which has slope equal 


to 1. Hence 
lim In (: - ‘) a ae 
n> nh 


Furthermore, as can be seen from Figure 3, the slope of the chord increases 


y-axis ae 


TABLE -1 


1 a3 2 X-axis 


fae 2.2. Sao 


Figure 3 


2c , 
as n increases. Hence In (1 + —]} increases, and because In is an increasing 
n 


n 
function, it follows that the numbers (1 _ *) also increase. Since 
n 


in(i +4) <= ins, 


(1 +2) aie 


It follows immediately from the least upper bound property of the real num- 
bers that a bounded sequence of increasing real numbers must have a limit. 


Hence 
lim ( + ‘) exists. 


Since In is a continuous function, the limit of a natural logarithm approaches 
the natural logarithm of the limit, and so 


| = im in(1 “+ 1) = In| lim ( +3) | 
N20 n n—0 fh 


we know that 
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Since 1 = Ine and since there is only one number whose logarithm is 1, we 


conclude that 


l n 
In Table | values of (1 + ‘) for several values of n are compared with the 


limiting value e. 


PROBLEMS 


1. Find the derivative with respect to x of each of the following functions. 


(a) In x” 
(b) In(7x + 2) 
(c) Inv/(x — 3)(x + 4) 


(d) In(x” — 9x + 3) 


(e) In 
x 


(f) Inw7x3 


a Ss 
f 
(g) | 7 


(In x)” 
In(in x) 
In xv/ et 


2. Show that |x? + 2x + 3] = x? + 2x +3 and hence that In(@x* + 2x + 3) 


is defined for all x. 


3. If In2 = p, In3 = q, and In5 = +, write each of the following as a function 


of p,g, andr. 
(a) In10 

(b) In 0.25 
(c) In 6000 


4. Integrate each of the following. 


(6) | = - 


(d) 
(€) 
(f) 


(c) 


(d) 


In 0.625 
In 0.03 
In 1728. 
& — 3)dx 
x? — 6x + 2 
[ v2 +3dx 
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13x° — 2 > 
x3 — 6 = a (2x — pp 
Ps 3 “ 1 
. [(Gatts - sera) 7 i 


xX dx 
(g) i 


5. Sketch the graph of each of the following functions. Label all extreme points 
and points of inflection, and give the values of x at which these occur. Classify 
each extreme point as a local maximum or minimum. 


(a) f(x) = In + 4) (d) f(x) = x° — 4Inx’ 
(b) f(x) =Inx’ (ec) f(x) =x° + 4Inx’. 
(c) f(x) = Ind + x) 


6. Compute the following definite integrals. 


1 Be 
dt t at 
(a) ee (d) gee 
3 q cee 
x t 
(b) i se te) i 2 
2 <2 
5s x 
(c) ie — 4s + fat oe ©) L. x 


7. In each of the following examples, find the area of the region bounded by the 
graph of y = f(x), the x-axis, and the two vertical lines whose equations are 


given. 

(a) f(x) = fox = Band x =7. 
ih) - fix = MX, x = Land x = 4, 
(c) foo = Sx =9andx = 11 


Sixt = | fae , what is the domain of the function F? 
t as 
2 


x 


—2 


9. (a) For what values of x is | iar defined ? 


(b) What, if anything, is wrong with the computation 


2 
| ee = In|x — 1 
ar ae | 


2 
= In1l — In3 = —In3? 


—2 
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10. Use the appropriate form of L’H6pital’s Rule to evaluate each of the following 


limits. 
. Sax : In x 
(a) lm — (d) lim 
rao xX ae & | ee 
= hex : 3 
(b) lim —-> (e)*. im x in x, 
T— xm xz—>0+ 
n an arbitrary positive integer n an arbitrary positive integer. 


(c) lim xInx 
z—0+ 


2. The Exponential Function. According to the definition in Chapter 1, 
page 13, a function f is a set of ordered pairs with the property that the first 
member x of any pair (x, y) in the set determines the second member y, 
which we call f(x). For example, the set {(1, 3), (2, 5), (4, 3)} is a function. 
Suppose, for a given function f, we consider the set of all pairs (y, x) such that 
(x, y)is inf. This set may or may not be a function. In our example it is the 
set {(3, 1), (5, 2), (3, 4), which is not a function because it contains both 
(3, 1) and (3,4). Hence the first member does not determine the second 
uniquely. However, if the new set is a function, it is called the inverse 
function of f and is denoted by f*. 

If f is the function defined by f(x) = 7x, for every real number x, then 
the inverse function f—? exists and is defined by f-'(x) = 4x. On the other 
hand, the function f defined by 


f(x) = *, a a ee Oy 


does not have an inverse. The reason is that if f(x) = x? = 4, for example, 
we have no way of knowing whether x = 2 or x = —2. If we restrict the 
domain of f to the set of nonnegative real numbers, 1.e., 


oie) 28, Ox 
then f_* exists and is the function defined by 
[ues aes ee ee 


The following three elementary properties of functions and their inverses 
should be noted: 


(2.1) a 2. = 

C2)» = () if ond only at x = f(y). 

(2.3) Two functions f and g are inverses of each other if and only if 
f(g(x)) = x, — for every x in the domain of g, 
g(f(x)) = x, for every x in the domain of f. 
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The first of these, (f-')* = f, follows at once from the definition of f—. 

To prove (2.2), suppose first that y = f(x). This means that the ordered 
pair (x, y) belongs to the set f. Hence the pair (y, x) belongs tof‘. But this 
says that x = f-'(y). To prove the converse, suppose that x = f—1(y). 
Hence, by what we have just proved, we know that y = (f—')"‘(x). Since 
(jy =f, we Obtain = fix). 

The importance of (2.3) is that it provides a simple criterion for deciding 
when two functions are inverses of each other. For example, if f(x) = 


= 
3x + 5 and g(x) = a then 


feo) = (252) = 3 (FF) +5 => 


(esp S 
: = 


g(f(x)) = gBx+ 5) = 


and we may therefore conclude that g = f-' and that g + = f. The proof is 
completely straightforward (and rather tedious), and we omit it. 

The only way a function f can fail to have an inverse is if there exist at 
least two elements a and 6 in its domain for which a # 6 and f(a) = f(b). 
Suppose that f is a strictly increasing function and that a and 6 are two 
distinct numbers in its domain. Then either a < borb <a. Ifa < b, then 
f(a) < f(b); and if b < a, then f(b) < f(a). Thus it is impossible that 
f(a) = f(b), and we conclude that f exists. A similar argument applies to 
any function that is strictly decreasing, and so 


(2.4) If f is strictly increasing or strictly decreasing, then f has an inverse. 


The natural logarithm has been shown to be a strictly increasing function, 
and hence must have an inverse. Let us call this function exp(x) and justify 
the name after we have looked at its properties. Thus, by (2.2), 


y = ex) if and only if x = In y. 


Since In y is defined only for positive y, we see immediately that exp(x) 
is always positive. Furthermore, for any real number x, there exists a number 
y such that x = In y because the graph of the equation x = In y crosses every 
vertical line. Hence exp(x) is defined for every real number x. Finally, 
since 0 = In 1, we obtain | = exp(0). Summarizing, we have 


Expl xy 0, <o = 


eS exp(0) = 1. 


To develop the algebraic properties of the function exp, let p = exp(a) 
and q = exp(b). Then a = Inp and b = Ing. Hence 


a+ b= Inp-+ Ing = Inpag, 
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and therefore pg = exp(a + b). Replacing p and q in this last equation, we 
obtain the important fact that 


(2.6 ) exp(a)- exp(b) = exp(a + Db). 


l l 
Similarly, —a = —Inp = Ber , and RereEare 5, = exp(—a). Replacing 


p in this equation, we find that 


(2.7 ) exp(a) = exp(—a). 
If we apply (2.6) to the sum of the two numbers a and —), we get 


exp(a — b) = exp[a + (—))] = exp(a)- exp(—D). 


Since 
a : 
it follows that 
exp(a) _ = 
(2.8 ) tts a exp(a — b). 


If a function f has an inverse, then the ordered pairs which comprise it | 
become the ordered pairs of f~' when we interchange the order of each pair. 
Hence the graph of f~' may be obtained from the graph of f by interchanging 


y = exp(x) 


x-axis 
Figure 4 


x and y. This is equivalent to a reflection across the line y = x. The graph of 
y = exp(x) is thus the reflection of the graph of y = In x across this line, 
and it is shown in Figure 4. The curve passes through (0, 1) and gets closer 
and closer to the x-axis as x decreases without bound. As x increases without 
bound, so also does exp(x). 
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The graph of y = exp(x) is a smooth curve, and it is obvious geometri- 
cally that there is a tangent line at every point. We conclude that exp is a 
differentiable function. [For an analytic proof of this fact, see Theorem 
(3.4) in the next section.] We may compute the derivative by implicit differ- 
entiation. Consider the equation y = exp(x) and its equivalent equation 
In y = x. The latter implicitly defines exp(x) since In[exp(x)] = x. Hence 
from In y = x we obtain 


2, oor 
deans <a 
s9 
de 
dy 
dx 7 


Replacing y by exp(x), we get 


(2.9 ) 


Alternatively we may write: If f(x) = exp(x), then f’(x) = exp(x) for every 
real number x. 

Thus exp(x) is a most remarkable function, one which is equal to its own 
derivative. It is quite easy to show that this function and constant multiples 
of it are the only functions with this property. [See Problem 8 at the end of 
this section. ] 

Another property of exp arises as a consequence of the logarithmic 
equation, In p’ = rlnp, for positive p and rational r. If we again let 
a= np, then p= exp(o)<and or =rinp = nz’. Equivalent to 
ar = |n p’ is p” = exp(ar). Since p stood for exp(a), we conclude that 


(2.10 ) [exp(a)]” = exp(ar), —_ for all a and rational r. 


We know that exp(0) = 1 and may wonder what exp(1) is. If we set 
y = exp(l1), then 1 = In y. But eis the only number with a natural logarithm 
équal to 1. Hence, exp(1) = <. 

As an application of (2.10), we see that if x is a rational number, then 


e” = texp(lyTr = expt l* 2} = expGy. 


What about e* if x is real but not rational? Note that if a is any positive 
number, we have previously encountered a raised only to rational powers. 
For example, at this point we have no idea what 3” even means. For the 
number e, however, there is a very natural way to define e* for all real 
numbers x. We have just shown that if x is rational, then e” = exp(x). 
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Since the function exp has every real number in its domain, we shall define 
e* to be exp(x) if x is not rational. Hence 


e* = exp(x), _—_ for every real number x. 


We shall define a”, for an arbitrary positive number a and real x, in Section 
4, and then 3” will make sense. 

The reason for the term “‘exp,”’ which is an abbreviation for “‘exponen- 
tial,’ should now be clear. The function agrees with our previous idea of an 
exponential for rational values, and it has the following properties translated 
from those derived earlier in this section: 


Girceut, 
b b 
fog =e". e 
eek. (1) 
ee 
a 
.—— fee 
ob 


(2) 


If u is a differentiable function of x, then the Chain Rule implies that 


fea (fe)& Sinc Sy ape = have 
ee ee eee 
oi Sea ae 
ge on © Aix GP, 


EXAMPLE 1. Find the derivative of each of the composite functions 


(ayhe? t= vb 2c -—- ©) 


e2x3 


d 
For the first, we have — et? = ett (2x £7) = 2e"*") “For {b), 
dx dx 


d 2 2 d vA 
— e*” = e* — x? = 2xe*, For (c), we write — as e°**” and differentiate 
dx dx ° e 


6x 
to get e~?*"(— 6x”) or — pas 
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Since e” is its own derivative, it is also its own indefinite integral. Hence 


(2.11 ) 


More generally, from (3) [or, equivalently, from (6.5), page 213] we 
obtain the integral formula 


| Oe BSS, 
(723 fe nox = ° + ¢, 


EXAMPLE 2. Compute the following integrals: 


br 2 o847 et dx 
(a) fe dx, (b) [xe dx, (c) i — 


du 
To solve (a), we let u = 5x. Then = 5, and multiplying by 2, we have 
2 ay = | | agape io | uv du 
[- dx : eddx = % Ca 


d 
In the same manner we solve (b) by letting vu = x3 + 7. Then ee bo 
x 


I 
| 


Omitting the explicit substitution of the variable u, we write 


3 3 3 
: [ve TT dx = sfe V3 x ide ede ie 


Part (c) combines logarithms and exponentials. Since e” lacks only a factor 
of 4 to be the derivative of 4e” — 3, we can supply the 4, and the integral is 


1 du 
then of the form [- = dx, which is equal to In|u| + c. Hence 
u dx 


e dx l l - = 2 
year: ih dx = % In|4e —3/+c. 

Each of these answers can be checked by differentiating to see if we get back 
the original integrand. For example, in (b) we get 


aie E 


<q ee 


e347 2 23417 
ait gan 2 teh 


_ %, 
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In (c), 
I 2 
iim waE = 


hee 3. 


= x In|4e” — 3] +c) = 


Since the natural logarithm and the exponential functions are inverses 
of each other, an application of (2.3) gives the two useful formulas 


EXAMPLE 3. Simplify (a) e?!"* and (b) e***™*, For the first, since 
2 In x = In x’, we have 
e2inzx ae gina? ae 


For part (b), 


eitiing — pipsinz _ ereinz® — 93,5. 
PROBLEMS 


1. One must be careful to distinguish between the inverse of a function and the 
reciprocal of the function. If f(x) = 3x + 4, write 


(a) f(x), the value of the inverse function f~' at x. 
(b) [f(x)]~}, the reciprocal of f(x). 


2. If fhas an inverse f—!, what relations hold between the domains and ranges of 
Pa: eos 
3. Find the derivative with respect to x of each of the following functions. 


(a) = 
i= 
(b) 4e CS 
co x (i) 4 
() e™ g = 
Xx 
(ce) e*Inx ao 
(f) e +e (1) a. 
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4. Solve each of the following integrals. 


(a) fx dx (f) fase 
2esx 


(b) | 2xe" dx (g) | me 
(c) ~ ca (h) | (x + De” dx 
(d) | rsh - dx (i) | e" dx 


5. Evaluate each of the following integrals. 


2 x 
(a) | a (c) | e dt 
a. oe 
3 8 
(b) | e”” dx (d) | ii 
0 4 € 


6. Sketch the graph of each of the following equations. Label all extreme points 
and points of inflection, and give the values of x at which these occur. Classify 
each extreme point as a local maximum or minimum. 


(a) y= en id) y= xe 
rity 


(by y= cine iC y= 2 


G) ghee 


7. In each of the following examples, find the area of the region above the x-axis, 
below the graph of the function f, and between the two vertical lines whose 
equations are given. 


(a) f(x) = 2e",x =0 and x =2. 
(6) fix) = - ya 2 ee ee 
ic) Fie = a x=3 and “w= 7. 


8. Suppose f is a function which has the property that it is equal to its own 
derivative; i.e., f’ = f. 


x. 


(a) Compute the derivative of the quotient —— 


(b) Using the result of (a), prove that f(x) = ke* for some constant k. 
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9. Let f be the function with domain [0, 1] and defined by 


10. 


M, 


12. 


13, 


14. 


15. 


f(x) = +</2x — x’, eae as 5 


Draw the graph of f and the graph of the inverse function f—. 
Compute each of the following limits using L’HOpital’s Rule or some other 
method if you prefer. 


io. a ———— (e) lim — 
t— >0 X— 00 e 
oA oe ee . ees 
(b) lim (f) lim — 
Fee, x I— 00 xé 
ser = vig 
(ce) lim © = (g) lim a 
z—1 aoa 1 z—0+ e 
25 Be ——_ 
eo. ta —*—. a 
z30] — e xz—0 x 


Show by means of an example that half of (2.3) is not enough. That is, define 
two functions f and g such that f(g(x)) = x, for every x in the domain of g, 
tote ~ f-"'. 

C= 


; 1 meee «| a , 
Prove that lim = |, using i, e* = e* and the definition of the deriv- 
z—0 x x 


ative at 0. 
A function of x is a solution of a differential equation if it and its derivatives 
make the equation true. For what value (or values) of mis y = e”* a solution 
d*y dy 
of — 3—+2y = 0? 
ax dx 
Find each of the following limits. 


n 


n— oO 


2n 
(a) lim (: 4 ‘) 2 n an integer. 


—3zxz 
(b) lim (: oh 1) , x an integer. 


T— 


Let f be a function differentiable on some unbounded interval (a, ©). Prove 
that if lim [fixy) + f'(x)] = L, then lim (() = L. ..[ Aint: -Gonsider the 


— 00 ae) 


quotient 


] 


ef) | 
a 


3. Inverse Function Theorems. In this section we shall prove some basic 
theorems about continuous functions and the inverses of monotonic functions, 
which we have already used in studying the exponential function. These 
theorems are all geometrically obvious. We shall show that they also follow 
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logically from the definitions of continuity and monotonicity using the least 
upper bound property of the real numbers. This shows, as much as any- 
thing, that these definitions say what we want them to say. To put it face- 
tiously, if these theorems could not be proved, we would change the defini- 
tions until they could be. 


(3.1) INTERMEDIATE VALUE THEOREM. Jf f is continuous on the closed 
interval [a, b] and w is any real number such that f(a) < w < f(b), then there 
exists at least one real number c such thata < c < band f(c) = w. 

A similar theorem is obtained if the inequalities f(a) < w < f(b) are 
replaced by f(a) > w > f(6). The proof, in all essentials, is the same. To 
see that (3.1) is geometrically obvious, look at Figure 5. The horizontal line 
y = wmust certainly cut the curve y = f(x) at least once. 


a b x-axis 


Figure 5 


Proof. Consider the subset L of the interval [a, b] that consists of all numbers x in 
[a, b] such that f(x) < w. The set L is not empty because in particular it 
contains a. Since every number in L is less than b, the number b is an upper 
bound for L. It follows by the Least Upper Bound Property of the real num- 
bers (page 7) that L has a least upper bound, which we denote by c. More- 
over, c lies in [a,b]. There are three possibilities: 


Fern wy, 
(ii) f(c) > w, 
(iil) = fic} = w, 
We shall show that (4) and (ii) are, in fact, not possible. Suppose that 
(i) holds. Since f(b) > w, it follows that c < b. Set e = w — f(c), which is 


positive. Since fis continuous at c, there exists a positive number 6 such that 
whenever |x — c| < 6 and x is in [a, 5], then | f(x) — f(c)| < «. Hence, 
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there exist numbers x in [a, 5] larger than c for which | f(x) — f(0)| < «. 
For any such x, 


Hay F).= |fa) ~ flo) < « =w — £0), 


and so f(x) < w, which implies that x belongs to L. Thus there are numbers 
in L which are larger than c, which contradicts the fact that c is an upper 
bound. 

Next suppose that (ii) holds. Since f(a) < w, it follows that a < c. This 
time set « = f(c) — w, which is positive. The continuity of f at c implies the 
existence of a positive number 6 such that if |x — c| < 6 and x is in [a, 5], 
then | f(x) — f(c)| < €. Pick xo in [a, b] such that |xo — c| < 6 and x) < c. 
We contend that xo is also an upper bound for L. For if x is any number 
such that x» < x < c, then |f(x) — f(O| < «. Consequently, 


Or rey ar FO Se = yw 


and so —f(x) < —w, or, equivalently, f(x) > w, which implies that x is 
not in L. This proves the contention that xo is an upper bound for L, which 
contradicts the fact that c is the least upper bound. 

The only remaining possibility is (iii). Hence f(c) = w, and the proof is 
complete. 


This theorem, (3.1), was used in Section 2, where it was asserted that, 
for any real number x, there exists a number y such that x = In y. We have 
previously shown that the natural logarithm takes on arbitrarily large positive 
and negative values. Hence we can “surround” a given number x with values 
of In. That is, there exist numbers a and b for which Ina < x < Inb. The 
existence of a number y such that x = In y now follows immediately from 
(3:9). 

An interval was defined on page 4 to be any subset J of the set of all 
real numbers with the property that, if a and c belong to Janda <b<o, 
then 5 also belongs to J. The following proposition is therefore fully equiv- 
alent to Theorem (3.1). [More precisely, it is equivalent to the conjunction 
of (3.1) and its companion theorem with the inequality f(a) > w > f(d).] 


(3.2) If the domain of a continuous real-valued function is an interval, then so 
is its range. 


The reader should convince himself that (3.1) and (3.2) are equivalent. 
We have already proved that every strictly monotonic function has an 
inverse [see (2.4), page 250]. However, more is needed than simply existence: 


(3.3) If fis a strictly increasing continuous function whose domain is an interval, 
then the same is true of the inverse function f—. 
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A companion theorem is obtained if “increasing” is replaced by 
“decreasing.” 


Proof. There are three things to be proved: (i) f~! is strictly increasing, (ii) the 
domain of f—! is an interval, and (iii) f—! is a continuous function. The first 
is completely straightforward, and we leave it as a problem. The second 
follows at once from (3.2) and the observation that the domain of f~! is 
equal to the range of f. In proving (ili), we shall assume that the interval 
which is the domain of fis neither empty nor consists of a single point. This 
is reasonable, because in these two cases the assumption that fis continuous 
is not particularly meaningful. Let 5 be a number in the domain of f—!, let 
a = f—1(b), and let e be an arbitrary positive number. If ais an endpoint of 
the interval which is the domain of f, the following argument must be modified 
slightly. We shall assume that a is not an endpoint and also that ¢ is sufficiently 
small that both a + e anda — e are in the domain of f/ (see Figure 6). Set 6 


y-axis 


y = f(x), or, equivalently, 
—x=f'(y) 


Figure 6 


equal to the smaller of the two numbers f(a + «) — b and b — f(a — e€). 
Then, if y is any number in the domain of f—! such that |y — b| < 6, we know 
that 

76-2 << fe +e) 


Since f—! is strictly increasing, we have 
a-—e=fI(fa-—9))<f 0) <f'Y¢@+ 9) =at+e 


which implies that —e < f—'(y) —a <e. Since a = f—1(b), the latter 
inequalities are equivalent to 


Jt O).= fae 


which proves that f—! is continuous. This completes the proof. 
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Our final theorem concerns the differentiability of an inverse function. 
It was used in Section 2, where we asserted that the exponential function exp 
is differentiable. 


(3.4) Let f be a strictly monotonic differentiable function whose domain is an 
interval. If b = f(a) and if f’(a) # 0, then f™ is differentiable at b. More- 


over, 


Proof. According to the definition of the derivative, we must show that 
at es 
tp eee ne i bs 5, 
yb ee f'@ 
Lex = {—'y); Then y = s(x) and, of course, a = f(D). ‘Hence 


Oe ek 1 
y—b fe) — f@ feo—f@ 


© ieee © 


Since the reciprocal of a limit is the limit of the reciprocal, we know that 


ee 
7@ | f@- fea -7O—I@ 
ae, oe ewe | 


It is a simple matter to finish the proof provided we know that x approaches 
a as y approaches b. But this is just the assertion that f—! is continuous at BD, 
which we know to be true as a result of Theorem (3.3). Hence 


2S eae Ae 
ie 3 fC) — FO) 
X= 4 
= | 
Sa = fO)- 6) Soe See) 


yb ¥=2 


= (f Y@), 
and the proof is complete. 


We have also used Theorem (3.4) before to establish the differentiability 
of the function g defined by g(x) = x1/", where n is a positive integer and x is 
any positive real number (see page 72). The inverse function f, defined by 
i (x) = x", for every positive real number x, is strictly increasing and has a 
positive derivative at every point in the interval (0, 0). Theorem (3.4) tells 
us at once that g is a differentiable function. 
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PROBLEMS 


1. Prove that if f is a strictly increasing function, then the inverse function f~! is 
also strictly increasing. 

2. Does the assertion in Problem 1 remain true if “‘increasing’”’ is replaced by 
‘“‘decreasing”’ ? 

3. Show by giving an example that a strictly increasing function is not necessarily 
continuous. 

4. Give an example of a differentiable strictly increasing function defined for all 
values of x whose inverse is not differentiable everywhere. 

5. Show that Theorem (3.4) is geometrically obvious. [Hint: The derivative is the 
slope of the tangent line, and the graph of y = f(x) is the same as that of 
x = f"0).] 

6. Supply the details which prove that Theorem (3.2) is equivalent to (3.1) [Le., 
to the conjunction of (3.1) and its companion]. 


4. Other Exponential and Logarithm Functions. If m and n7 are integers 
and n > 0, then 
Sa 


Hence, for any rational number r, the number 2’ is defined. But what is 
2° if x is not rational? More generally, how should a’ be defined for an arbi- 
trary real number x and a positive number a? 

If x is a rational number and a is positive, we have shown that 
x Ina = Ina’, and therefore a” = e”!"*. However, e”!“ is defined for every 
real number x. We shall take advantage of this fact, and, if x is real but not 
rational, we define a® to be e7/™*. Consequently, for every real number x, 
we have 


This function, so defined, has all the familiar properties of an exponential 
function: 


a > ©) —£R Gx <x CK, 


(4.1 ) Gg =a, 


sEC.4] OTHER EXPONENTIAL AND LOGARITHM FUNCTIONS 263 


The proofs follow readily from the properties of the functions In and 


exp. For example, 


1 1] ] 
a=e'™ = e™4 =a, 


f 7 y zina oY Ine Pla ates 


1 
hee ee = En" 


= Qt", 
The derivative of a” is easily computed from its defining equation. Since 


esveyc csineoviminn 8 fis. ss 
see =" @ 7; Xina) = a In a, 


we have the formula 


(4.2 ) 


More generally, if u is a differentiable function of x, the Chain Rule implies 
that 


EXAMPLE 1. Compute the derivative of each of the following functions: 


(a) ta (b) a (c) qr) 


For (a) we get 
d 


for (b), 
oe cae es a ee 2p 2)(2x) 
rg Be 
and for (c), 
£ TOLD SO GG 5 2 a we 2? (In 22 2 
= 2” +*(In 2)’. 


If a = 1, then a® = e*! = e® = 1 for every real number x. Hence 1’ 
is the constant function 1. 
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If a > 1, then the graph of the function a” resembles the graph of e*. 
The slope of the tangent line to the graph is always positive, for if a > 1, 
then Ina > O, and, since a® > 0, we see that 


—f =aqd Wa >. 
x 


This means that a” is a strictly increasing function (see Problem 10 at the 
end of this section). The second derivative is also always positive, since 


2g. Be ome e 2 
—— @ =. Ine} = a Unay > 2, 


Hence the graph is concave upward for all x. Moreover, there are no extreme 
points, critical points, or points of inflection. The graph is drawn in 


I 
Figure 7. It is relatively flat on the left, passes through (- 3 *), (O17), 
a 


y = a*, where a> 1 


Figure 7 


and (1, a), and goes upward to the right. For greater values of a, the graph is 
flatter on the left and steeper on the right. 
If 0 < a < 1, the function a* may be studied by considering it in 


See l Ee ) 
another form, (‘) . Since — > I, the graph of the function (‘) is of the 
a a a 
type described in the preceding paragraph, and the graph of a’, which is 
2 ee 
equal to (‘) , 1S the same curve reflected across the y-axis. It is steep on the 
a 


l 
left, passes through (- l, = , (0, 1), and (1, a), and flattens out as it goes to 
a 


to the right. It is drawn in Figure 8. 
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Every derivative formula has a corresponding integral formula. Since 


Sie. § eG 
Ce ae ee 


3 


the integral formula corresponding to (4.2) is 


(4.3 ) 


As always, the Chain Rule provides a generalization. If u is a differ- 
entiable function of x, then 


udu, _ a 
[a ne = + ¢, 


Ina 


(2) 


y-axis 


y = a“, where 0 <a<l 


] 


Figure 8 
EXAMPLE 2. Compute each of the following indefinite integrals: 


(a) iE dy, (b) | 0" dx, (c) f Lean dx. 
A direct use of (4.3) gives for (a) 


aay = 3 + ¢. 
: d 
mince — x = eS ee 
dx 


I ; 
tegral (b) can be written 5 | 10 ay, 
rie . 
which by (2) is equal to in 10 +c; Hence 
n 


2 
eno oe eS 
| x10 ax = In 10 + ¢, 
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, and therefore that the integral is of 


d l 
For part (c) we note that — In x = — 
dx x 


the form in (2). Thus 


nel — 23D" 
fesn Pigs in?31 + ¢, 


It was proved on page 241 that In a” = rina, for every rational number 
r and every positive real number a. We are now in a position to remove the 
restriction that 7 be rational. Let x be an arbitrary real number. Then 
a® = e*!™@ and so Ina® = In e*™* = Inexp(xIna). Since In and exp are 
inverse functions of each other it follows that In exp(x Ina) = xIna. We 
have therefore proved that 


(4.4) Ina® = x Ina, for every real number x and every positive real number a. 


Another of the well-known laws of exponents now follows easily: 
(4.5) (a*)’ = a™, for all real numbers x and y and every positive real number a. 


Proof. If we let a® = 6, then (a*)” = bY = ev»? Replacing 4 in the last expression, 
we have 


x Ina” 
ay = 


and, using (4.4), 


y Ina” y(x Ina) aylina 
e = _¢ = @ : 


Since et¥™@ = qt’, it follows that (a”)Y = a*¥, and the proof is complete. 


In particular, (e”)” = e*” for all real numbers x and y. 
Let a be any real number, and consider the function f defined for every 
positive real number x by 


fx) = x*. 


Hitherto in this section we have considered the function a*. Now we reverse 
the roles of constant and variable. One of the basic rules of differentiation 
proved in Chapter 1 states that, if a is a rational number, then 


We now remove the restriction that a be rational. Observe first that x* is 
certainly a differentiable function, since it is the composition of differentiable 
functions: 


x” = e!"" = exp(aln x). 
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Knowing this, we use implicit differentiation to compute its derivative. Let 
= X°: Thee ey = ew” = oli xand so 


cae ey 
a ax é 
pag 8° OP 
ydx “x 

dy _ ay 

fe x 


ax® 


; a 
Since y = x“, it follows that Bee = ax*'. Thus we have proved that 
x 


(4.6) x* = ax*"", for any real number a. - 


dx 


The technique of taking logarithms and differentiating implicitly, which 
was used in proving (4.6), can also be used to compute the derivative of a 
positive differentiable function which is raised to a power which is itself a 


d 
differentiable function. For example, to compute Ss owe ety = x”... Then 
x 


ny = In" 20x In x; 
and it follows that 


os yd + Inx) = x*(1 + In x), Os > &). 


This technique is known as logarithmic differentiation and is a basic tool for 
ies | d : 
finding derivatives. We can use it to derive a formula for a u”, where wis a 

x 


positive differentiable function of x and vu is any differentiable function of x. 
Let.y.= w’,,and theacnay =e in-u.. lence 


Ye dv 
® = »(2 + Inu) 
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and finally, therefore, 


ee _ dv 
= == oe ‘Inu as ES) 


We do not suggest that the reader memorize this formula. It is more im- 
portant to be able to use the method of logarithmic differentiation. 


d x ° x ° 

EXAMPLE 3. Find = (x? + 1)°. Letting y = (x? + 1)° and taking natural 
x 

logarithms, we have 
lag ino. by = Ge 


Differentiating, we obtain 


lL dy —— x 2 
y dx Pree +e In(x +1), 
- | Soe +} 
Hence 
d 9 er A eee € 
J tT ee | Pe In +0) 


The function a* is strictly monotonic if a is positive and not equal to 1, 
increasing ifa > | and decreasing if0 < a < 1. Moreover, it has a nonzero 
derivative at every x. It follows by Theorem (3.4), page 261, that a* has a 
differentiable inverse function. Even as the inverse function of e” is the 
natural logarithm, we call the inverse function of a* the logarithm to the base 
a. Hence 


We emphasize that a must be a positive number different from 1 and 
that log,x is defined only for positive values of x. The so-called common 
logarithm, usually denoted by simply log and encountered in the usual tables 
of logarithms, is the logarithm to the base 10. Thus log 100 = logi,100 = 2, 
since 10° = 100. The logarithm to the base a has the same algebraic proper- 
ties as the natural logarithm: 


loge la=-0, 
log,a = 1, 

(4.7 ) logapg = logap + logaq, 
logs (= logap — loged, 


logap” = b logap. 
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The above properties hold for every positive real number a different 
from 1, for all positive real numbers p and g, and for every real number b. 
Each one may be proved by considering the corresponding exponential 
function. Note that since a* and log,x are inverse functions of each other, 


log a =X, for all real x, 


aa" = x, _ for all positive real x. 


For Guampies 11 we ict x — log.) and y = log,g, then we have p = a* and 
G = dane so 


log,pq = log,(a*a”) = log,a™"™ = x+y 
= log,p + logag. 


The other properties are proved in the same way. 

To compute the derivative of log,x, we let y = log,x. The equivalent 
exponential equation is x = a’, from which it follows that In x = Ina’ = 
ylna. By implicit differentiation, therefore, 


d d 
oe (vind) = ix 0 Ls 

i. 

3 


(4.8 ) 


PROBLEMS 


1. Find the derivative with respect to x of each of the following functions. 


Cope cae (el aa 

(b) logi(x’ + 1) (h) logy(x” — 4”) 
(c) logi4*** ae oe 
oo G< 

(e) xa° (k) (’*y. 
oe 


2. If a and 4 are positive numbers not equal to 1, prove that log,b = , : 
Og,a 
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3. Prove that 
ib logax | 


(a) Inx = (In a)(logax) iy ie = 
logue 


4. Integrate each of the following. 


(a) | T” dx (e) | logve’* ° dx 


2n323 +4 - In |2+3| 
(b) [2 dx (f) oor 3 3 dx 
= akon xs 
(c) f inte + ala (g) fes rt 
(d) i : dx 
x x 


5. (a) Differentiate logarithmically y = woe te. 


(b) For what values of x is the differentiation in (a) valid? 


6. If uw is a positive function of x and a is positive but not equal to 1, show that 


In u 
logau ies 
Ina 


7. Differentiate each of the following functions with respect to x. 
(a) Pi (c) Oe ie 
(b) (In x)" (ay tira)’. 
8. (a) Given only that log,1 = 0 and that log.pg = log.p + log.ag, prove that 


1 
loga ( = —log,p. 
p 
(b) Then prove that log, Ae logup — logug. 
q 


9. Prove all the properties listed in (4.7) (see Problem 8). 

10. Using the Mean Value Theorem, page 113, prove that if f’(x) > 0 for all x, 
then fis a strictly increasing function. 

11. Assume that a > 1. 


(a) Using the definition of a*, show that 


lima = 2, 


(b) Using the result of (a), prove that 


lim a = 0. 


t— —O 
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12. 


$3. 


(c) Using (a), show that 


ee: coer 
lim —a = o., 


T—O X 


(d) Using (b), show that 


* See 
li —q = (0. 
P a dx . ‘ 


(e) What do (a), (b), (c), and (d) say geometrically about the graph of the 
function a*? 


Assume that a; > a, > 1. 

(a) Using the definition of a’, show that, if x > 0, then a;* > ao*. 

(b) Using (a), show that, if x < 0, then ay" < ao". 

Evaluation of a limit of the form lim f(x)!“ is not obvious if any one of the 


following three possibilities occurs. 


ay tim (60 = lim gx) =o 


ay dm f(x) Landdtiviets) = ©. 
(i). tan f(x) = and lim g(x) = 0. 


These three types are usually referred to, respectively, as the indeterminate forms 
0°, 1%, and ©°. The standard attack, akin to logarithmic differentiation, is the 
following: Let 


ix) = in ar" = g(x)In f(x) = we 


g(x) 
One then applies L’Hopital’s Rule to the quotient, thereby hopefully discover- 
ing that lim /(x) exists and what its value is. If it does exist, it follows by the 


w~—a 
continuity of the exponential function that 
lim mn 
ra : h(x) 
= lime’. 


tT 


1 


But, since 
g(x) 


gue 43 Pi 


= f(xy, 


we therefore conclude that 
[ lim neh 
mi foyis=-e X* , 
and the problem is solved. 
Apply this method to evaluate the following limits. 
Ls 


(ayes lims od rr@y etiigar Ei code): apa ior ax? 


z—>0-+ ESO x— 0 
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5. Introduction to Differential Equations. For a given differentiable function, 
we are frequently interested in an equation which contains the derivative of 
the function and which is true for every number in the domain of the function. 
These equations arise naturally in physics and in many applied branches of 
mathematics. An example of such an equation is obtained if y is the function 


d 
of x defined by y = 2e**. Since - = 3( 26°") = 239, the equation 


dy = 

dx cs 
holds for this particular function y and all real values of x. For another 
example, let y be the function defined by y = x* — x’. It is easy to verify by 


differentiation and substitution that the equation 


is true for this function and all real values of x. 

The two equations in the preceding paragraph are examples of differ- 
ential equations. They are called first-order differential equations because 
they involve the first derivative of the function but no higher derivatives. In 
each example above we started with a function and then found an equation 
containing its derivative. More commonly we encounter the differential 
equation and then set out to find the function. For example, for what function 
y is the equation 


true for all values of x in the domain of y? If such a function exists, it is called 
a solution to the differential equation. Generally speaking, if a differential 
equation has one solution, it has infinitely many. We may be required to 
find one solution to a given differential equation, or possibly all solutions. 
Let us try to fix the ideas in the above examples by giving a general 
definition. Consider an equation in three variables x, y, and z, which we write 


F(x, 2) = 0. 
Not all the variables need occur in the equation, but at least z must. Sub- 


Soe . 
stituting ss for z, we obtain the equation 
x 


F(xy2) =o, (1) 


which is a first-order differential equation. This equation, however, is merely 


<= d 
a formal statement of equality containing the symbols x, y, and a . As such, 
x 


it is neither true nor false. By a solution to (1) we mean any differentiable 
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function f such that the equation 


F(x, f(x), f'@)) = 0 


is true for every x in the domain of f. 

The reader should realize, of course, that there is nothing sacred about 
the letters x and y which we have thus far used to denote the independent and 
dependent variable, respectively. For example, the differential equation 

dx 


al as 
i e 


t 
has for a solution the function of t defined by x = = 


In this section we shall consider some simple types of first-order differ- 
ential equations and the techniques for solving them. Other first-order differ- 
ential equations and differential equations of higher order will be studied in 
Chapters 6 and 11. 2 

The first type to be studied has already been solved in this book. Let f 
be a given continuous function, and consider the differential equation 


(2) 


A solution is any function y with the property that its derivative is the func- 
tion f. That is, a function is a solution if and only if it is an antiderivative, or 
indefinite integral, of f. Hence, if F’(x) = f(x), we have 


y= | poyde = Fe) + 6 (3) 


where c is an arbitrary constant. Thus solving the differential equation is the 
same thing as finding the indefinite integral. As c ranges over all real numbers, 
we get all antiderivatives and therefore all solutions to the differential equa- 
tion (2). For this reason, (3) is called the general solution to the differential 
equation. 


EXAMPLE |. Find the general solution of each of the following differential 
equations: 


(a) on Oo 4 3 1, 
ae i 
(b) aoe I, 


(c) x 5 = (In x)’. 


274 LOGARITHMS AND EXPONENTIAL FUNCTIONS | [cHaApP. 5 


Solving (a), we obtain 
y= [ox +2x- Idx =x°+x°—x+e. 


Similarly, for (b), 
x= {(e’-—1l)dt=e’'—tt+ec. 
As it stands, (c) is not in the form of (2). However, an equivalent equation is 


= 


| 
= — (In x), and so 
gy 


l 
— Sua 
y= | x) pias 


dx 
bee 82) 


3 


d 
The integral is of the form fe ve ax, Were Ww = Ih x; Netice 


The second type of differential equation which we consider in this 
section arises when we are given two continuous functions f and g and form 
the differential equation 


(4) 


A differential equation of this form is called separable. An equivalent equa- 

ae d 

tion is 80) = f(x), where the variables have been ‘“‘separated” in the 
x 


sense that on the right we have a function of x and on the left a function of y 
and the derivative of y. The differential equation can be readily solved pro- 
vided we can find antiderivatives of f and g. From the latter equation we 
obtain 


| oy) dx = | f(x) dx. (5) 


Suppose that F’(x) = f(x) and that G’(y) = g(y). That is, 
[ 100 ax = FiIxeer ¢, 


| g(y)dy = Gly) + k. 
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because, by the Chain Rule, 


4 1G) + = G0) = a) 2. 


It follows from (5) that G(yv) + k = F(x) +c. This tells us that G(y) and 
F(x) differ by the constant c — k, which for convenience we rename simply 
c. Therefore, we finally obtain the equation 


Gv) = F(x) + 6, (6) 


which implicitly defines any solution y of the original differential equation. 
Conversely, if we differentiate (6) with respect to x, we get (4) again: 


= “IF (x) + 4h 


. 
Q 
—~ 
ww 

| 


G'Q) = = £0), 


py = $603; 


dx 
dy _ f(x), 
as Oy 


Thus, for any value of the constant c, every differentiable function y defined 
implicitly by (6) is a solution. Hence (6) defines the general solution to the 
separable differential equation (4). 


EXAMPLE 2. (a) Find the general solution to the differential equation 


d 
= oe - (b) Find the particular solution whose graph passes through the 


point (2,1). This is a separable differential equation, and “separating 
; : d 

variables’ we replace it by the equivalent form y A = x, y =~ 0. It follows 

that 
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and, integrating both sides, we obtain 


2 


a 
Stes 
em See 


If we multiply by 2, we get y> = x° + 2c. But twice an arbitrary constant is 
still an arbitrary constant, so we replace 2c by simply c. Hence the general 


d is ees 
solution to ss ee is implicitly defined by the equation 


¥ =a 4. ee 
Solving for y explicitly, we obtain 
y=4vVre4+ce 


as the answer to part (a). To find the particular solution that passes through 
(2, 1), we substitute x = 2 and y = 1 in @) to get 1 = 4+ c¢, whence 


c = —3. Since y takes on the value 1, which is positive, we choose the 
positive square root, and the answer to part (b) is therefore the function 
defined by 

y= Vx? — 3. 


dx 
corresponding function F(x, y, Zz) is a polynomial of first degree in y and z, 
ie., if F(x, y,z) = f(y + g(x)z + h(x). Thus among the differential 
equations 


d 
A first-order differential equation F(x y, *) = 01s called linear if the 


1 

7 2 ate 
dx 

dy _ x, 

dx \?y 


the first two are linear and the third is not. The last type of differential equa- 
tion which we study in this section is the simplest linear type, 


(8 ) 


where & is an arbitrary constant. 
Actually every such differential equation is also separable, since it can 


; d k ; x 
be written in the form = — Vy . We treat it as a third type because it is 
| ¥ 


dx 
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linear and because it has many interesting applications. Solving it as a sep- 
arable differential equation, however, we first replace it by the equivalent 


ld 
equation — —e —k. Then 
dx 


Integrating, we obtain 
Injy) = —kx + ¢, 


which defines the general solution implicitly. Since the natural logarithm and 
the exponential are inverse functions, we can solve for |y|, getting 


ly| = gee ae ee % 


yes Glee ye. (9) 


As c takes on all real values, the quantity e° takes on all positive real values. 
Thus e° is simply an arbitrary positive constant, and +e° is therefore an 


ae ; ae: 
arbitrary nonzero constant. The original differential equation - + ky =0 
x 


certainly also has the constant function y = 0 asa solution. From this fact 
and (9) we conclude that the general solution to the linear differential equa- 
tion (8) is 


y=ce™, (10 ) 
where c is an arbitrary constant. 


EXAMPLE 3. Let x be the amount of radium present in a pile at time ¢. Thus 
x is a function of f. It is known that the rate of radioactive decay of the pile 
of radium is proportional to the amount x that remains in the pile. 

(a) Show that the length of time 7 for an amount x to diminish by 


; : ae ‘ 
radioactive decay to an amount 5 is independent of x. The number 7 1s 


called the half-life of radium. It is equal to approximately 1550 years. 

(b) If 0.01 grams of radium is present at t = 0, how much is present 
after 500 years? 

The rate of change of the amount of radium present with respect to 


oe ea os = Pe 
time is given by the derivative he This is positive for growth and negative 


for decay. Since the rate of decay is proportional to the amount present, 
l.e., to x, we know that 
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where k is some positive constant of propertionality. This is the differential 
equation governing the physical process. We have shown that the general 
solution is 


x = ce! eS 
where c is an arbitrary constant. If after an interval of time equal to T the 


x 
amount has dwindled to >? we have 


x = 
= ge 
2 


—kG+T) kt 


Solving this equation for x and using (11), we get x = 2ce ee 
from which it follows that 2e~*? = 1, or 2 = e*?. Hence, In 2 = In(e*”) = 
kT, and we conclude that 

In 2 


f= 


which is independent of x. 


To do part (b), let us denote by x» the amount of radium present at time 
t = 0. Hence from (11) we get 


—k-0 
ag £e =; 
—kt ° In Z . 
and sox = xe ”’. Since k = Fo? we obtain the formula 
—(In 2/7 
= 


which expresses the amount of radium present at time ¢ in terms of the 
original amount at time ¢ = 0 and the half-life of radium. In our problem 
x) = 0.01 grams, t = 500 years, and T = 1550 years. Hence, the answer is x 
grams, where 


x = (0.01 )e~ 5 In? — 0.008 (approximately). 


PROBLEMS 


1. Find the general solution of each of the following differential equations. 


d. 

@) 2 -3y= @) @+dyS =O +0 
dy eas 2 dy Fos. hey 

(b) Bre Se (e) © 


dy = ee See 
(c) sero (f) oe. =. Z; 
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2. For each of the following differential equations, find the particular solution 
whose graph passes through the point indicated. 


i y ; 
(a) —~ £e passing through (1, 1). 
dy é 
ih 2 a 3y, passing through (0, 5). 
dy 3 : 
(c) y = 18x", passing through (2, —9). 
dy = : 
(d) —+-— = 0, passing through (5, 0). 
a. 7 


3. A curve defined by y = f(x) has slope m at every point (x, y) given by m = 2y. 
If the curve passes through the point (0, —1), find f(x). 


4, Find all solutions to the differential equation nll - . Sketch the graphs of 
the different solutions. dx Eg 


5. Find all solutions to the differential equation - ar . Sketch the graphs of 
x y 


the different solutions. 
6. Classify each of the following differential equations as separable, linear, both, 


or neither. 
d ——— 
(a) Inyo == © Vy +1=+xy =0 
ae 3 dx 
d x 1 dx 
(b) XS +y=e (g) -— =3 
x x dt 
dy x 9 
(c) a = 0 ih —-~ = x 
xX y 
dx = dy : ES 
(d) — SF (i) = + 3y = 7x. 
dx 3 
(e) PA — 7x = 0 


; ae 
7. Analternative approach to solving the linear differential equation = +ky =0 
x 


ae d Sa eee d 
is to write it as - = — ky. The latter equation is similar to = = y, which 
x : 


has e* for a solution. With this similarity in mind, it is not hard to guess, and 
then verify, that y = e~** is a solution to the original equation. The problem 
is now to show that every solution is a constant multiple of e~**. Prove this 


fact by assuming that y = f(x) is an arbitrary solution of = + ky = 0 and 


then showing that the derivative of the quotient eS is zero. (See Problem 8, 
page 256.) E 
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8. A radioactive substance has a half-life of 10 hours. What fraction of an 
amount of this substance decays in 15 hours? 

9. If a certain population increases at a rate proportional to the number in the 
population and it doubles in 45 years, in how many years is it multiplied by a 
factor of 3? 

10. Find the constant of proportionality relating a radioactive substance and its 
rate of decay if the substance has a half-life of 16 hours. 


er d 
11. The number y of bacteria in a culture grows ata rate = proportional to the 
: 


number present. If the number doubles in 3 days and there are 10” bacteria 
present at the beginning of the experiment, how many are there after 24 hours? 

12. A toy block lying on the floor is given a kick. The resulting acceleration v’ 
(which is negative) is equal to —5v. If the kick gives it an initial velocity of 6 
feet per second, how many seconds later is the velocity equal to 2 feet per 
second? 

13. A car sliding along a track slows down at a rate proportional to its velocity. 
If it has one-half its initial velocity after 30 seconds, at what fraction of its 
initial velocity is it traveling after 1 minute? 

14. Let aand 5 be constants with a # 0. Show that the differential equation 


Deg 613;) 
dx 


reduces to ° + az = Oif we let z = y — a As a result, find the general 
by al 


solution of (12). 
15. Use the substitution described in Problem 14 to find the particular solution of 


the differential equation . — 2y = 6 which passes through the point (0, 4). 
x 


CHAPTER 6 


Trigonometric Functions 


Although the reader is probably familiar with the geometry of sines and 
cosines, etc., in terms of angles, our definitions will emphasize them as 
functions of a real variable. Thus the stage is set for the development of the 
differential and integral calculus of these important functions. Later in the 
chapter we introduce complex numbers where exponential and trigonometric 
functions are blended in the famous equation e’” = cos x + isin x. Wecon- 
clude with an application to linear differential equations. 


1. Sine and Cosine. In this section we shall define and study the elementary 
properties of two real-valued functions, the sine and the cosine, abbreviated 
sin and cos, respectively. Both functions have as domain the entire set of all 
real numbers, and, as we shall see in Section 2, both are differentiable func- 
tions. 

The definitions will be made in terms of what is called arc length, by 
which is meant the distance from one point on a curve to another measured 
along the curve. This is a new concept, for although we have defined the 
straight-line distance between two points on page 11, we have not yet treated 
distance along a curve. Actually we shall postpone the discussion of arc 
length in general to Section 2 of Chapter 10 because here we need it only for 
distance along a circle, in fact, only along the particular circle C which is the 
graph of the equation x? + y? = | in the xy-plane. However, we shall as- 
sume that the idea of distance along this curve is understood. For example, 
we assume the familiar fact that the arc length of the whole circle C, i.e., 
its circumference, is equal to 27. 

Let ¢ be an arbitrary positive real number. We denote by P(t) the point 
on the circle C whose distance from the fixed point (1,0) along C in the 
counterclockwise direction is equal to ¢. Intuitively, we take a piece of string 
of length ¢, fasten one end at (1,0), and wrap the string counterclockwise 
around C. Then P(f) is the point on the circle to which the other end of the 
string reaches. Next, for every negative number f¢, let P(t) be the point on C 
whose distance from the same fixed point (1, 0) along the curve in the clock- 
wise direction is equal to —+¢. That is, this time we wrap the string in the 
opposite direction. Finally, for t = 0, we set P(O) = (1,0). The definition 
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is illustrated in Figure 1. Thus, to every real number f¢, we have assigned a 
point P(t) which is an ordered pair of real numbers. Note that, because the 
circumference of the circle C is 27, it follows that 


P(t + 2r) = P(0), es 


for every real number f. 


P(O) = (1,0) 


Figure 1 


The cosine and sine are now defined as follows: cos(f) is the x-coordinate 
of P(t), and sin(f) is the y-coordinate. More briefly, we write cos ¢ and sin f. 


Thus 
Per = (cos 1 si 7). 


For example, from Figure 1, in which P(5) is seen to be in the fourth quadrant, 
we may conclude that cos 5 is positive and that sin 5 is negative. It is clear 
geometrically that, if the difference between two real numbers ¢ and a is 
small, then the point P(t) is close to the point P(a) and hence the differences 
between their corresponding coordinates are small. More precisely, both 
cos t — cos a| and |sin ¢ — sin a] can be made arbitrarily small by choosing 
|t — a| sufficiently small. It follows that the cosine and the sine are con- 
tinuous functions. Their common domain is the set of all real numbers. 

For certain values of t which are simple fractions of the total circum- 
ference 27, it is easy to locate the point P(t) on the circle and then to read off 
the coordinates cos ¢t and sin t. For example (see Figure 2), 


P(O) = CI, 0), 
P (5) = (4V2, 4V2), 
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P (5) = (0, 1), 


P(r) ies f=4, 2) 


from which it follows at once that 


cos0 = 1 and sinO = 0, 
cos 7 = sin 7 = 1/2, 
T pees | fe 
cos, = Oand sin = iz 
cos7 = —1 and sing = 0, 
30 - oF s 
cos = 0 and sin = —1|. 


The reader should be thoroughly familiar with all these values—not by sheer 
memory, but from an understanding of P(¢) and its location on the circle C. 


P(= = (0,1) 


Figure 2 


Most of the important properties of the functions cos and sin can be 
expressed in a few equations called trigonometric identities. Some of these 
are obvious from the definition of P(t). To begin with, it follows from (1) that 


(cos(t + 2m), sin(t + 2r)) = P(t + 2r) = P(t) = (cost, sin 2), 
and so 


cos(t + 27) = cost 


CLY) for every real number t. 


sin(t + 27) = sin t 


284 TRIGONOMETRIC FUNCTIONS | [cHaAp. 6 


The property of cos and sin stated in these two equations is expressed in 
words by saying that cos and sin are periodic functions with period 27. That 
is, each time the value of the variable is increased by 27, the value of each 
function is repeated. 

Next, since P(t) lies on the circle defined by x? + y? = 1, the coordinates 
of P(t) must satisfy this equation. Hence 


C42) (cos t)? + (sint)? = 1, ~—_— for every real number t. 


There is a strong tradition for abbreviating (cos t)” and (sin f)” by 
cos”?t and sin"?, respectively, provided n is a positive integer. [However, one 
never writes sin” 't for (sin t)*.] Asa result, (1.2) is usually written 


cos*t + sin?*t = 1. 


The third basic property of cos and sin comes from the relation between 
P(t) = (cos t, sin t) and P(—1t) = (cos(—2), sin(—f)). It is not hard to see 
that the difference between measuring the arc length |¢| in the counter- 
clockwise direction from (1,0) and measuring the same distance in the 
clockwise direction will be only a difference of sign in the y-coordinate of 
PQ). That is, 


P(—t) = (cos(—9), sin(—1)) = (cos ft, —sin £). 
It follows that 


(1.3 ) cos(—f) = cost 
sin(—t) = —sint 


for every real number t. 


Thus the cosine is an even function, and the sine is an odd function (see 
pages 90 and 92). 

If a and b are any two real numbers, what can we say about the relative 
positions of the points P(a), P(b) and P(a + b) on the circle C defined by 
x + y? = 1? It follows from the definition of P(7) that the point P(a + b) 
is obtained by moving from P(O) = (1, 0) a distance |a + b| along C counter- 
clockwise or clockwise according as a + b is positive or negative. However, 
it is important to realize that P(a + 5) can be reached in two steps another 
way: First, move from P(Q) a distance |a| along C, counterclockwise or clock- 
wise according as a is positive or negative. This move will take us to P(a). 
Second, move from P(a) a distance |b| along C, counterclockwise if |b] is 
positive and clockwise if |b| is negative. By examining the different cases— 
a> Oand b > 0, then a < Oand 6 > O, etc.—one can verify that the final 
point reached in these two steps is P(a + b). Note, however, that in the 
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second step we move from P(a) to P(a + 56) in exactly the same way that we 
would move from P(0) to P(b) according to the definition of P(6)—by 
moving along C the same distance and in the same direction. Thus the 
distance moved along the circle from P(a) to P(a + 5) is equal to the distance 
moved from P(O) to P(b). It follows that the straight-line distances are the 
same, too. That is, the straight-line distance between P(a + b) and P(a) is 
equal to the straight-line distance between P(b) and P(O). This important fact 
is illustrated in Figure 3. 


Figure 3 


We can now derive a formula for the cosine of the difference of two 
numbers, cos(c — d), in terms of the cosines and sines of c and d. Leta = d 
and b= c-—d. Thena+ b = ¢, and it follows directly from the conclu- 
sion of the preceding paragraph that the straight-line distance between P(c) 
and P(d) is equal to the straight-line distance between P(c — d) and (1, Q). 
But 

Pic) = (cop vain ©), 


P(d) = (cos d, sind), 
P(c — d) = (cos(c — a), sin(c — d)). 
Hence by the formula for the distance between two points, 


Vv (cosc — cos dy? + (sinc — sinc dy 


Squaring both sides and multiplying out, we get 


cos’c — 2cosccosd + cos’?d + sin?’c — 2sincsind + sin’d 
= cos*(c — d) — 2cos(c — d) + 1+ sin*(c — d). 
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This equation can be greatly simplified by use of the formula cos’f + sin’¢ = 1 
three times. The result is 


2 — 2cosccosd — 2sincsind = 2 — 2cos(c — @), 


from which follows the identity 


(1.4) cos(c — d) = cos c cos d + sin ¢ sin d, 


for all real numbers c and d. 


A similar formula for cos(c + d) can be obtained easily from (1.3) and 
(1.4). We have 


cos(c + d) = cos(e — (—d)) = cos ccos(—d) + sinc sin(—d). 


Since cos(—d) = cos d, and sin(—d) = —sind, we get 


(1.5) cos(c + d) = cosccosd — sinc sind, 


for all real numbers c and d. 


T 


- 1 ee 
Taking c = —o (1.4), we get cos ( = ) = cos 5 COs a+ Sin’, sin d. 


Since cos = = Q- and sin 5 = |, the result is the useiat equanen 
COS (5 — ‘) = sind. This equation implies its mate. If we write it letting 


= : =e then > — d = aand we obtain cos a = sin ( = a). Thus we 


have proved the symmetric pair of identities 


T = 
ae (5 = ) sieges for every real 


number a. 
) = Cosy 


(1.6 ) 


The remaining two identities are the formulas for the sine of the sum and 
difference of two numbers. The first follows easily from (1.4) and (1.6): 


cos (F - @ + 5) = cos((F - ) - ) 
cos (F - a) cos b + sin (5 — ) sin b 


sinacos 5b + cosasin b. 


sin (a + b) 
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Thence, by (1.3), 


sin(a — b) = sin(a + (—d)) 


sin a cos(—b) + cosa sin(—b) 


| 


| 


sin acos b — cosasin b. 


We write these together in the formula 


Bee sin(a + b) = sinacosb + cosa sin b, 


for all real numbers a and b. 


An alternative approach to the trigonometric functions is made with a 
domain of angles instead of real numbers. We shall show that the two ap- 
proaches are in no way contradictory. 

It is assumed that the reader knows what an angle is and what its initial 
side, its terminal side, and its vertex are. An angle a is said to be in standard 
position on a Cartesian grid if it has its vertex at the origin and its initial side 
lies along the positive x-axis. If any point, excluding the vertex, on the ter- 
minal side is chosen, it has an abscissa x and an ordinate y and lies at a 
distance d from the origin. Although each point on the terminal side has its 


; sete a” ; 
x, its y, and its d, it is easy to see that the ratios and ; are independent of the 
q 


choice of the point, i.e., in Figure 4 we have = = - and - = = 


y-axis 


(X14) 


Figure 4 
Since the location of the terminal side depends on the angle, each of these 
ratios is a function of the angle and we define the cosine of a to be : and the 
sine of a to be Ze Since we may choose any point (not the origin) on the 


terminal side, we could simplify the process by choosing the point where 
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d = 1. This, of course, is the point where the terminal side cuts the circle 
with equation x° + y? = 1. Thencos ais the abscissa of that point and sin a 
its ordinate. Hence in many ways the two approaches are the same. 

We have not yet mentioned units for measuring angles. If we pick a unit 
to agree with the arc length, we would have exact agreement: The cosine of 
an angle of u such units is equal to the cosine of the real number u and the 
sine of an angle of wu such units is equal to the sine of the real number uw. This 
unit is called the radian and it should be obvious that there are 27 radians 
in an angle of one revolution. Another unit, probably more familiar to the 


: Se 1 : 
reader, is the degree, which is 34,5 of a revolution and —— of a radian. Thus 


the cosine of an angle of d degrees is equal to the cosine of the real number 
wd 


180 
PROBLEMS 
1. Find the values of 
(a) sin(—7) and cos(—7) (d) sin (2) and cos G ) 
(b) sin(#i) and cos(27) (e) sin G ) and cos (3) 


(c) sin (>) and cos eo (f). “sin (= =) and cos (= a) 


2. Make a table like the one below showing the sign of cos ¢ and sin ¢ in each of 
the four quadrants. Put + or — in each entry of the table. 


3. Find all values of ¢ such that 


(a) sn¢ =) (e) sint = —1 
(db) cosr =U (i) cost = +1 
Cy ysiire = (ig) sin? = "2 
(d)* ees'F 2" Ch): “es ZF 


4. What is the domain and range of each of the functions cos and sin? 
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10, 


. If A is an arbitrary integer, find 


(a) cos kr (c) cos & + kr) 
(b) sin kr (d) sin é cs kr) 
: T . Sa . Ae 1 
: Remembering that 4 ++ Pe = 12 and 4 ee 6 = 12 » find 
so9 
(a) sin =a (c) sin 5 
Tv 
(b) cos = es 


. If fis a function with the property that f(t + 27) = f(#), for every real number 


t, show from this that 
(a) f(t — 2r) = f(t), for every real number ¢. - 


(b) f(t + 2xn) = f(t), for every real number f and every integer n. (Use 
induction.) 


. (a) Use (1.5) to write a formula for cos 2a in terms of cos a and sin a. 


(b) Similarly, use (1.7) to write a formula for sin 2a. 


, eas a a 
(c) Write a formula for cos a and another for sin a in terms of cos : and sin a 


. Use the formula for cos 2a [Problem 8(a)] and the identity 1 = cos’a + sin’a 


to derive a formula for 
(a) cos?a in terms of cos 2a (b) sin?a in terms of cos 2a. 


Let f be a function which is periodic with period 27, 1.e., f(t + 27) = f(a), 
and suppose that the graph of ffor 0 < t < 27 is as shown in Figure 5. Draw 
the graph of ffor —2r < t < 6m. 


Figure 5 
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2. Calculus of Sine and Cosine. The formulas for the derivative and integral 
of the functions sin and cos follow in a straightforward way from one 
fundamental limit theorem. It is 


(2.1) 


Proof. It is convenient first to impose the restriction that t > 0 and prove that the 
limit from the right equals 1; ie., 


Hy = eT, es 


Since, in proving (1), we are concerned only with small values of +, we may 
assume that t < = Thus we have 0 < ¢ —. and, as a consequence, 


sin t > 0 and cost > 0. Let S be the region in the plane bounded by the 
circle x? + y? = 1, the positive x-axis, and the line segment which joins the 
origin to the point (cos ¢, sin f); 1.e., S is the shaded sector in Figure 6. Since 


y-axis 


(cos f, sin f) 
br ! 


Figure 6 


the area of the circle is 7 and the circumference is 27, the area of S is equal 
t : ; : ; ; 
to —-rT = - Next, consider the right triangle 7; with vertices (0, 0), 


(cos ¢, sin rf), and (cos rf, 0). Since the area of any triangle is one half the base 
times the altitude, it follows that area(T;) = $ cost sint. The line which 


sin ¢ 
passes through (0,0) and (cosf, sins) has slope Le and equation 
COs 
sin ¢ 


se Setting x = 1, we see that it passes through the point 
COS 
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sin f :; ' ; : 
(1 sn) , as shown in Figure 6. Hence if 7» is the right triangle with 
COS 


vertices (0, 0), (1. sii ) , and (1, 0), then 
cos t 


1 sin f 1 sin ¢ 
ie ee i ay Poni pan thie tee 
aregt ta) pA : cos ¢ 2 COE? 


Since 7; is a subset of S and since S is a subset of 7, it follows by a funda- 
mental property of area [see (1.3), page 171] that 


area(T;) < area(S) < area(T»). 


Hence 
1 t 1 sin f 
—cosfsin¢tf<=-< = 
Z T pu 2 Cost 
2 
If we multiply through by —— , we get " 
sin f 
t 1 
or. = SF 
sit T cast 


Taking reciprocals and reversing the direction of the inequalities, we obtain 
finally 
1 sin f 

— > — > cost. 24 

COS [ t 
With these inequalities, the proof of (1) is essentially finished. Since the 
function cos is continuous, we have lim cost = cos0 = 1. Moreover, the 

t—0+ 
1 


limit of a quotient is the quotient of the limits, and so lim —— = - = l. 
t—>0+ COS 1 


sint .. = 
Thus = lies between two quantities both of which approach 1 as tf ap- 
proaches zero from the right. It follows that 


; gin ¢ 
lim —— = 1. 
t—>0+ t 


It is now a simple matter to remove the restriction ft > 0. Since 
wy —sint -emt—/ 


f sa: ae 3 


, we know that 


sinf — sin|f| 
Re SS i rg ee a > 
, A (3) 


As t approaches zero, so does ||; and as |r| approaches zero, we have just 
proved that the right side of (3) approaches 1. The left side, therefore, also 
approaches 1, and so the proof is complete. 
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It is interesting to compare actual numerical values of ¢ and sin t. 
Table | illustrates the limit theorem (2.1) quite effectively. 


TABLE | 

t sin f 
0.50 | 0.4794 
0.40 | 0.3894 
0.30 | 0.2955 
0.20 | 0.1987 
0.10 | 0.0998 
0.08 | 0.0799 
0.06 | 0.0600 
0.04 | 0.0400 


0.02 | 0.0200 


A useful corollary of (2.1) is 


Cia) 
—— za . 1 —cost. 
Proof. Using trigonometric identities, we write —-————— in such a form that (2.1) 
is applicable. f 
2 t seas t 
‘ cos 5 + sin oe 


COS f 


COs Se Ot ee eae 
ge lop pse 2 2 
: t 
Hence 1.— cost = 2 sin? 3.208 


1 — cost 


oo 
= — sin 
t t 


t ; 
As t approaches zero, 2 also approaches zero, so, by (2.1), the quantity 


eu. : ; ee 
approaches 1. Moreover, sin is a continuous function, and therefore sin : 


approaches sin0 = 0. The product therefore approaches 1-0 = 0, and the 
proof is complete. 
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In writing values of the functions sin and cos, we have thus far avoided 
the letter x and have not written sin x and cos x simply because the point on 
the circle x° + y” = 1 whose coordinates define the value of cos and sin has 
nothing to do with, and generally does not lie on, the x-axis. However, when 
we study sin and cos as two real-valued functions of a real variable, it is 
natural to use x as the independent variable. We shall not hesitate to do so 
from now on. 


EXAMPLE |. Evaluate the limits 


tng”) Him.) mS 


We evaluate the first two limits by writing the quotients in such a form 


sin 
that the fundamental trigonometric limit theorem, lim oe 1, is ap- 
plicable. For (a), =o 
sin3x _ sin3x Tx 3 
we 3 tin Tx 7 
gifl-3x 
As x approaches zero, so does 3x and so does 7x. Hence approaches 


1, and 


7x (= 1% 


—1 
= : ) approaches 1~' = 1. We conclude that 
x 


sin 7X 


sin 3x 
wien 1X 


oe) () 
| 
co 


To do (b), we use the identity cos?x + sin?x = 1. Thus 


2 o9 A 
l1—cosx sin x os es 
—————— = —— = sinx —— : 
x x x 
; sin x 
As x approaches zero, sin x approaches sin 0 = 0, and approaches 1. 
Hence 
1 — cos’x 
hi =)- |=) 
x—-0 


For (c), no Jimit exists. The numerator approaches 1, and the denominator 
approaches zero. Note that we cannot even write the limit as +« or —x&% 
OS x 


takes on 


: : sos ; C 
because sin x may be either positive or negative. As a result, 


both arbitrarily large positive values and arbitrarily large negative values as 
x approaches zero. 
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“ BRSP La. 
We are now ready to find oP sin x. The value of the derivative at an 
bs 


arbitrary number a is by definition 


(4 sin x) (ay = Tim Simer toi) “a SUH | 


dx {+0 i 


As always, the game is to manipulate the quotient into a form in which we 
can see what the limit is. Since sin(a + ft) = sinacost-+ cosasint, we 
have 

sinfa + 1) — sina sin acos t + cosa sin f — sina 


t t 


I 
Q 
€ 
N 
S 
a 
ee 
| 
a 
pe 
S 
| 
| 
| 


As t approaches 0, the quantities cosa and sina stay fixed. Moreover, 
sin ¢ 1 — cost ; 
is approaches 1, and ec approaches 0. Hence, the right side of the 


above equation approaches (cos a): 1 — (sina): 0 = cosa. We conclude 
that 


(4 sin x) (a) = 608 a, for every real number a. 


Writing this result as an equality between functions, we get the simpler form 


(2.3) £ sin 3 = COs x. 


The derivative of the cosine may be found from the derivative of the 
sine using the Chain Rule and the twin identities cos x = sin (; _ x) and 
sin X = Cos (: — x) [see (1.6), page 286]. 


Fe = oo ety 
a oe 2 i 


Writing this result in a single equation, we have 


| 

Q 

S 

1? 2) 
Ye 
Naa 

| 

~< 
Wei saaie 
Sle 
Pail 
NI] >A 

| 
hag cea 


I 
GC) 
ie) 
N 

| 

| 
* 

—— 
oo 
| 
, 
| 

| 
= 
~ 
> 


(2.4) 
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EXAMPLE 2. Find the following derivatives. 


d . > d e t 
(a) a sin(x + 1), (c) qj sne> 

d d : 
(b) Ax 68 cm, (d) PS In(cos x)’. 


These are routine exercises which combine the basic derivatives with the 
Chain Rule. For (a) we have 


& sin(x* + 1) = cos(x® + 1) 4 (x? + 1) = 2xcos(x* + 1), 


The solution to (b) is 


# cos 7x = —sin 7x = 7x = —7Tsin 7x. 
dx dx 
For (c), 
— sine’ = cos nee e' = e' cose’ 
dt dt ; 
and for (d), 
d a l d 2 
oe In(cos x) = (ous wide (cos x) 
A cette 2 COS x cs COS Xx 
~ (cos x)? dx 
ee a 
a COS x 


Every derivative formula has its corresponding integral formula. For 
the trigonometric functions sin and cos, they are 


| si xdx = —cosx+c, 


(2.5) 
[cos x dx sinx + c. 


296 TRIGONOMETRIC FUNCTIONS | [cHAp. 6 


The proofs consist of simply verifying that the derivative of the proposed 
integral is the integrand. For example, 


d d 
Ae OS® +c)=- Ax COS ¥ = Sin x. 
EXAMPLE 3. Find the following integrals. 


(a) | sin 8x dx,  (b) | x cos(x’). dx, =? (ec) | cos’x sin x dx. 


The solutions use only the basic integral formulas and the fact that if 
d. 

Fo =f then [ro ole = F(u) +c. Integral (a) is simple enough to 
x 


write down at a glance: 
| si 8x dx = —gZcos8&x +c. 


d 
To do (b), let w =-x’*.* Then = = 2 oe 
x 


| cos(x*) dx = 4 | (costs'))28 dx 
du 
ae mi 
= ¥ | (eos u) A 
= ¢sinu+c 
= isin(x’) +c. 
du ; 
For (c), we let uo = cos x. Then — —sin x. Hence 
x 
| cos's sinxdx = — [ costs(—sin H-dx 


! 

| 
in 

a 

a 


| 
| 
|- 
Q 
© 
N 
fer) 
ya 
(ga) 


The graphs of the functions sin and cos are extremely interesting and 
important curves. To begin with, let us consider the graph of sin x only for 


C= se ; . A few isolated points can be plotted immediately (see Table 2). 
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TABLE 2 
x y = Six 
0; 0 
| 2 
6 2 
: $\/2 = 0.71 (approximately) 
: 4/3 = 0.87 (approximately) 
4 
Z 


d 
The slope of the graph is given by the derivative, = sin x = cos x. At the 
ix 


Cie us us 
origin it is cos 0 = 1, and, where x = 5 the slope is e855; =-0.-Since 


ee PO 2d Poo ¢ ees 
dx 2 


we know that sin x is a strictly increasing function on the open interval 
us = ; : 
(0 | . In addition, there are no points of inflection on the open interval 


and the curve is concave downward there because 


9 


me | : 
ne = cod x — sin 0 ifi-<6a< 
dx? dx 2 
On the other hand, the second derivative changes sign at x = 0, and so there 
is a point of inflection at the origin. With all these facts we can draw quite an 
accurate graph. It is shown in Figure 7. 


Figure 7 
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It is now a simple matter to fill in as much of the rest of the graph of 
sin x as we like. For every real number x, the points x and z — x on the real 


number line are symmetrically located about the point - The midpoint 
, x Sen ; 
between x and 7 — x is given by xt as a - . As x increases from 


T T 
0 to 7 the number 7 — x decreases from 7 to 5° Moreover, 


sin(a — x) = sin cos x — cos7 sin x 


O-cosx — (—1):sinx 


= sin x. 


It follows that the graph of sin x on the interval E r| is the mirror image of 


the graph on [o, “| reflected across the line x = : . This is the dashed curve 


in Figure 7. Now, because sin x is an odd function, its graph for x < 01s 
obtained by reflecting the graph for x > 0 about the origin (i.e., reflecting 
first about one coordinate axis and then the other). This gives us the graph 
for —r <x <x. Finally, since sin x is a periodic function with period 2r, 
its values repeat after intervals of length 27. It follows that the entire graph 
of sin x is the infinite wave, part of which is shown in Figure 8. 


Figure 8 


The graph of cos x is obtained by translating (sliding) the graph of sin x 


to the left a distance 5 This geometric assertion is equivalent to the algebraic 


us ee 
equation cos x = sin € + s). But this follows from the trigonometric 


sin € fe *) 


identity 
Gy 


2 


e Tt e 
sin x COs 5 + cos x sin 


= (sin x4-0 => (03 x) 1 
COs x. 
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The graphs of cos x and sin x are shown together in Figure 9. 


Figure 9 
PROBLEMS . 
i. Evaluate the following limits. 
ea * 
sin t sin 2f 
lim —— f) lim—— 
(a) se t? i ao sin 3f 
jn oi | 
t— 0 t xr— mT /2 5 pane x) 
2 
(¢)- lam Jas = (h) lim cas 
r—() x* r— 0 x 
» 3m? ee on 2 
(4) tim == i 
r— 0) xX £510 yi 
__ sin(r — 5 ries deme 
gus Te Oe) r>o xX SINX 


2. Find the derivatives of the following functions. 


(a) sin(x’ + 3) (f) Insin x 
(b) cose (gz) sin®x? 
(c) cosrsint (h) os 
COS x 
(d) cos'x + sin*x (i) ses 
sin x 


(e) cos(sin x) iy 2 ein. 
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3. Evaluate the following integrals. 


(a) | cos 7x dx (f) | (cos f)cos(sin fr) dt 
(b) | (cos 2x + sin 3x) dx (g) | = “ dx 

(c) | e"cos e"dx (h) | cos’x sin x dx 

(d) | sin(x + a) dx (i) | sin’x cos x dx 

(e) | (cos x)e*'" "dx (j) | (cos’x + sin’x) dx 


4. Find the integrals 
(a) | cos" x sin x dx (b) | sin"x cos x dx. 


The next two integrals can be reduced to sums of integrals of the forms (a) and 
(b) by using the identity sin?x + cos?x = 1. 


(c) | sin’ x dx (d) | cos‘x sin’x dx. 
5. Express cos’x in terms of cos 2x, and thence evaluate {cos?x dx. 
6. Express sin*x in terms of cos 2x, and thence evaluate fsinx dx. 
7. Solve the differential equations 
dy ee dy. sn x dy COS x 
a) -—-=esmne b) — = — Cc eS : 
(a) dx : (b) ax. COSy (c) oe sin(y?) 


8. Evaluate the following limits using L’H6pital’s Rule. 


. t 

Gy tin | aia. = 

ey oe i350 

_ 1 _ €@€ —1— 
(b) lim tee (e) lim = : He 

xr—0 x—0 Xx 

_ sin’x oe Ses se 
(c) lim — (f) lm ———— 


9. Draw the graphs of the equations 


sin(27x) 


2 sin (* +- a 


(a) += 3 sm > icy.» 
T 
(b) -y-=-cos G x) 


| 


Qu 
See’ 
bee 

| 
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3. Other Trigonometric Functions. The other trigonometric functions are 
the tangent, cotangent, secant, and cosecant. They are abbreviated tan, cot 
(or ctn), sec, and csc, respectively, and the definitions are 


sin x l 
nx = SeCG. Xe 

COS x COS x 

COS x l 
cotx = — csc x = —~— > 

sin x sin x 


Unlike sin and cos, these functions are not defined for all real values of x 
since the denominators in the defining expressions are zero for some values 
of x. The set of all solutions to the equation cos x = 0 is the set consisting 


of all odd multiples of ‘ . Hence tan x and sec x are defined if and only if x 


is not an odd multiple of 5 . Similarly, cot x and csc x are defined for all real 


numbers x except integer multiples of 7. 

Although sine and cosine were first defined with a domain of real num- 
bers, we have shown that they also can be considered as functions with a 
domain of angles. Since the other four functions are defined in terms of sine 
and cosine, they may also be regarded as functions with a domain of angles. 
Thus it makes sense to speak of the tangent of the angle a, written tan a, 
and of the cosecant of an angle of 30°, written csc 30°. The former is defined 


if and only if the radian measure of a is not an odd multiple of , or, alter- 
natively, if the degree measure of a is not an odd multiple of 90. The latter is 


the reciprocal of sin 30° and is equal to — = 2. 


Dol | ee 


Two useful trigonometric identities, which are simply alternative state- 
ments of the basic equation cos’x + sin’x = 1, are derived as follows: 
Dividing first by cos?x, we have 


I I I 
2 Ze 
l fanx 6s) Set xX 


hence 1 + tan*x = sec’x. On the other hand, if we divide by sin?x, we have 


2 wi 2D) 
cose ere 24 
sintx  sin?x  sin2x 
I | I 


2 2 
cot'x l csc’ x, 
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and so cot?x + 1 = csc*x. Summarizing, we write 


(3.4) 


Another formula which we shall find useful is that for the tangent of the 
difference of two numbers, a — 5, in terms of tana and tan b. First, 


tan(a — 6) = —>——_+_ = 


Dividing both numerator and denominator by cos acos b, we get 


sin acos b cos asin b 
cos acos b cos acos b 
cos acos b sin a sin b 


tan(a — b) = 


cos acos b cos acos b 


sin a sin b 


cos a cos b 


sina sin 

cosa cos b 
Hence 

tana — tanb 
(3.2) tan(a — b) = sant re oer rere 


The trigonometric identities developed in this section are handy tools, 
and we shall ‘not hesitate to use them. In themselves, however, they are of 
secondary importance. Any one of them can be derived quickly and in a 
completely routine way from the basic identities in sin and cos derived in 
Section 1. 

An important application of the tangent function is in connection with 
the slope of a straight line. We define the angle of inclination « of a straight 
line L as follows: If L is horizontal, then a = 0. If L is not horizontal, then 
it intersects an arbitrary horizontal line H in a single point P. Let a be the 
angle with vertex P, initial side the part of H to the right of P, terminal side 
the part of L above P, and whose measure in radians satisfies the inequality 
0 <a < a (see Figure 10). We contend that 


(3.3) The slope of a line is equal to the tangent of its angle of inclination. 


Proof. We refer again to Figure 10. The given line is L, its inclination is a, and its 
slope is m. If L’ is drawn through the origin parallel to L, then L’ also has 
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inclination a and slope m. Furthermore, the point (cosa, sina) lies on L’. 
Since (0, 0) also lies on L’, the definition of slope yields 


which completes the proof. If L is vertical, its slope is not defined. But then 


the angle of inclination is : and tan : is not defined either. 


Figure 10 


Figure 11 


The importance of (3.2) is apparent when we try to determine the angle 
between two nonvertical intersecting lines. Let a be the angle of inclination 
of one line and 8 the angle of inclination of the other. For convenience, we 
assume that a > B. It follows that 0 < a — @ < m and, from Figure 11, 
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that the difference a — 8 is an angle between the two lines. We denote the 
slope of the first line by mm, and that of the second by m,. That is, we have 
Ay, = tan @ and ms-= tane.. By.G.2), 

tana — tan 6 m1 — Ms 


ee OS ee 


If this number is positive, then a — @ is the acute angle between the lines. If 
this number is negative, then a — £ is the obtuse angle between the lines. If 


this number is undefined, then a — 6 = 3 and the lines are perpendicular. 


The number Ras cic is undefined if and only if 1 + mym,. = 0. Since this 
WM» 


equation is equivalent to myn. = —1, it follows that we have proved the 
statement made on page 44 that two nonvertical lines with slopes m,; and m, 
are perpendicular if and only if myn. = —1. 

The formulas for the derivatives of the remaining four trigonometric 
functions are found using the derivatives of sin and cos together with the 
usual rules of differentiation. They are 


2 
sec x. 


—csc’x, 
(3.3 ) 


sec x tan x, 


—csc x cot x. 


Proving the first of these, we have 


gee : d 
cos x a sin x — sinx a eos 


d d sinx a d. 
= 1 eS ee = 
dx dx cos x COs2x 
2 aa 
cos x + sin’x ] 9 
SS ee Se ee SEC LY. 
COs2x Cos2x 


The others are left as exercises. We observe the following mnemonic device. 
From any one of the six formulas for differentiating trigonometric functions 
another one is obtained by adding the prefix “‘co”’ to every function which does 
not have one, removing the prefix ‘‘co” from each function which has it 
already, and changing the sign. For example, this procedure transforms the 


d d 
equation = sec X = sec x tan x into ae csc Xx = —csc x cot x and trans- 
IX LX 
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d oe 
forms PR = —sin x into Fae = cos.x. Hence the number of 
is x 


derivative formulas which need to be memorized can be cut in half. 
The integrals corresponding to the above derivatives are 


2 
| sec xdx = tanx+c, 


ese’x dx —cotx+c, 
(3.4 ) 


esc x coteedx —cscex +c. 


[sce x tan x de secx + ¢, 


EXAMPLE |. Find the following integrals: 
(a) | msec Oc + Dax, 
(b) | / tan x dx, 
(c) [eset cot’x dx. 


d 
In (a) we observe that = (x* + 1) = 3x’, or, equivalently, 
x 


Hence 


C 


| sect’ + 1)dx =i | [sec'(x + 1)] ke oe = thd 
= +tan(x + 1)-+ c. 


sin x 


For (b), we have [ran xaxX = | 
COS x 


and so 
l 
fran x ax jan - [ite 
Cos u dx 


= —Inlu| + c = —In|cos x| + c. 


du 
dx. Iiu = cos.x, then— = —sin x, 
dx 
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d 
Finally, to do (c), we see that since es cot x = —csc*x, the integral is, except 
x 


for a minus sign, of the form fe “ dx. Thus 
x 


I 


d 
ea | cesc’x cot’x dx = — | (cot’x) Fy COt X dx 
= —lcot’xtc. 


Each of these integrals can be checked by differentiation. 


The graph of tan x is an interesting curve, which we now describe. 
Note, first of all, that tan is an odd function, 


WanxX +7) = ——— eee ee 
dil cos(x + 7) cosxcoSm — sinx sinw 
—sin x 
= ———— = tanx 
—Ccos x 


Thus tan is a periodic function with period 7. The slope of the graph is given 
by the derivative, 


d nx = sec’x 
‘ dx : 


which is positive for every value of x for which tan x is defined. Hence 
; ' ee : 1 7 
tan x is a strictly increasing function in the interval — 5 <— Ks 5° From the 
Be sin x st 
definition, tan x = ——., we see that tan x is positive when both functions 
COS Xx 
an we aan 
are positive, as they are for 0 < x < 5 - is zero when sin x = 0, as it is for 
x = 0; and is negative when the two functions have opposite sign, as they 
TT - : 
eo tor = 5 <x <0. We also see that tan x takes on arbitrarily large 
= us 
positive values as x approaches 5 from the left, since sin x approaches | and 


cos x approaches 0. Thus 


lint. 140 X=: oo. 


xT | 2— 
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The second derivative is given by 
ene ae 
ee cae? tan x = 1. ae), 


dx? 


wn a ©, 


=-() FO < x<h* 


from which it follows that the graph is concave downward for — ; ee 
concave upward for0 < x < ; , and has a point of inflection at the origin. 


Combining all these facts with the few isolated values shown in Table 3, we 
obtain the graph shown in Figure 12. 


y-axis 
y = tan x 
_3a _ ae 7 be 3m oe X-axis 
2 2 2 2 
Figure 12 
TABLE 3 
| dy : 
xX | y = tanx —— = sec-x 
dx 
0/0 l 
T l 
| = =f) e (approx.) | = =. 1.33 (approk.) 
6) V3 
T 
rae | Z 
4 
T 2 
3 4/3 = 1.73 (approx.) | 4 
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The graph of cot x can be obtained in the same way in which we worked 
out the graph of tan x. However, there is a quicker way based on an identity. 
Since 


5 Tv - vin < Vv 
sin (x ote = sin X COS 5 + cos x sin x 


1 2 
tan (x ae = = ™\ 7 ae 
cos (x ~t. 7) COS X COS 5 _ sin x sin > 
COS X 
= = COL x, 
—sin x 


we know that 
cot x = —tan (x = 7) : 
The geometric significance of this identity is that the graph of cot x is ob- 


tained by translating (sliding) the graph of tan x to the left a distance ; and 
then reflecting about the x-axis. 


is l ke 
From the definition, sec x = ae it is apparent that 
COS x 


sak l I if n is even, 
CS SS ; : 
COS Nr —| if n is odd. 


If ais any odd multiple of ; ~then cos a = 0, and so 


l 
lim |sec x| = lim ———- = a 
40 za |COS x| 


y-axis 


Figure 13 
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Moreover, on an interval where one function is increasing, its reciprocal 
function is decreasing, and vice versa. It follows that the over-all shape of the 
graph of sec x can be ascertained quite easily from the graph of its reciprocal 
function cos x. The graph of sec x is shown in Figure 13. 

The graph of csc x is related to that of sec x in the same way as the graph 
of sin x 1s related to the graph of cos x. 


PROBLEMS 


1. Which of the six trigonometric functions are odd functions and which are even 
functions ? 

2. Derive the formulas for the derivatives of the functions cot, sec, and csc. 

3. Find the following derivatives. 


d 2 d 2 
(a) 7x 8 * (e) = (sect — 1) 
(b) att ata) (f) = esc(x® — 1) 
dx dx 
d d 
(c) sa In|sec x| (g) a tan x cot x 
d d 
(d)} 700s y tan) (h) 3 In|cot f]. 


- 


4. Prove each of the following identities from the basic identities in sine and cosine 
developed in Section 1. 


_ tan x + tany _ cot acoth ~ 1 
ety) = 1 —tanxtany Se cota + cot b 
fy esc x — sec (x — a (d) cot(x + m) = cotx. 


5. Find the following integrals. 


(a) J ron ax OX 


(b) | cot x dx 

(c) | e"sec e' dx 

(d) | tan°x dx [Hint: Use tan?x + 1 = sec?x.] 
(e) | tan’x sec’x dx 
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ivy, 


rf. 
FZ. 


7, 
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(f) | sec! tan x dx 


(g) | = csc\/ x cot\/ x dx 


(h) | csc x dx. 
d 


. Find ce (sec x + tan x) and use the result to evaluate the integral 


d. 


sec x + tanx 
sec x dx = sec x —— — dx, 
sec x + tan x 


tan x sec snore du . 
. Find f tan x.dx.= see ae dx by substituting wu = sec x and 7 in the 


sec-x 
right side. Compare the answer obtained with Example 1(b). 


. Draw the graph of (a) cot x, (b) csc x. 
. Evaluate the following limits. 


ay ieee (6) ieee 


z—0 x—0 

Find the tangent of the angle between 

(a) the straight lines y — 2x = 1 and 2y — x = 4. 

(b) the straight lines y + 2x = 2 and 2y — x = 2. 

(c) the tangent lines to the curves y = x* and x? + jy? = 1 at their point of 
intersection in the first quadrant. 

What is the domain and the range of each one of the six trigonometric functions ? 


Evaluate each of the following indeterminate forms (see Problem 13, page 271): 


(a) lim (sin x)"* (i nag Te. 
z—0+ z—0-+ 


If lim f(x) = lim g(x) = +, it is not immediately apparent whether or not 
lim (f(x) - g(x)) exists. Such limits are commonly called indeterminate forms 


of the type « — «. The usual method of evaluation is to express the difference 


f(x) — g(x) as a quotient and then to try to find its limit. For example, we 


write 


and, as x approaches zero, the limit of the right side can be obtained by two 
applications of L’H6pital’s Rule. Evaluate 


BD 2 1/3 
(a) lim ( = ‘) (b) lim Les 2 = i 


oer : x xr—0 
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© tim(=HE+5 _ 2) ©) tim (1 + Ins) 

r—0 sin x x z—0+ \X 

(d) lim (cot _ 122) (f) lim (sec x — tan x). 
t—>0 Tae 


4. Inverse Trigonometric Functions. The function sin does not have an 
inverse function. The reason is that it is perfectly possible to have a ¥# b 
and sina = sinb. Another way to reach the same conclusion is to consider 
the equation x = sin y. Its graph is the curve in Figure 14. It does not 


Figure 14 


define a function of x because it does not satisfy the condition in the definition 
of function (see page 14) which asserts that every vertical line intersects the 
graph of a function in at most one point. 


; T 1 ce 
However, on the interval | a 4 the function sin is a strictly increasing 


function. Hence, although sin does not have an inverse, it follows by 
Theorem (2.4), page 250, that the function sin with domain restricted to 


oe | a eee * : : 
| a | does have an inverse. This inverse function is denoted by either 


sin! or arcsin, and in this book we shall use the latter notation. Thus 


y= arcsinx if and only if 
Tv 


= 


and — 7S y & 
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The graph of the function arcsin is shown in Figure 15(b). It is that part 
of the graph of the equation x = sin y for which y satisfies the inequality 
— 5 ee a : . Note that the graph of arcsin is obtained from the graph of 
the restricted function sin by reflection across the diagonal line y = x. It 
follows both from the definition of arcsin and also from the illustration that 


X-axis 


y = arcsin x 


(a) (5) 


Figure 15 


the domain of arcsin is the closed interval [—1, 1] and the range is the closed 


= i| T 4 
nterval| — =>» =|: 
interv —— 


It is a consequence of Theorem (3.4), page 261, that the function arcsin 
is differentiable at every point of its domain except at —1 and +1. [In 


applying (3.4), let f be the function sin restricted to | a |. and then 


ff = arcsin.] We may compute the formula for the derivative either 
directly from (3.4) or by implicit differentiation. Choosing the latter method, 


d 
we begin with y = arcsin x and seek to find om li y= aresm x, “then 
x = sin y, and so x 


sin y or Lease y.- 


Hence 
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To express cos y in terms of x, we use the identity cos*y + sin*y = | and the 
fact that x = siny. Hence cos’y + x? = 1, and therefore 


cosy = Al = 


However, y is restricted by the inequality — 3 y= 5 , and in this interval 


cos y is never negative. Hence the positive square root is the correct one, 


and we conclude eee sud =. Thus 


— Vl— 


l 


(4.1 ) aresin x = 


EXAMPLE 1. Find the domain, range, and derivative of each of the composite 
functions 


(a) -aresin — (b) arcsin(In x). 


ss 


- is defined for every real number x and also satisfies the 
2 


Nt 
The quantity a 


< 1. Hence, arcsin ; ; 1s defined for every x; 


| 
i+x 
i.e., its domain is the set of all real numbers. The range of the function 
| 
| an 
Figure 15(b) that the function arcsin maps the interval (0, 1] on the x-axis 


inequalities 0 < 


however, is the half-open interval (0, 1]. It can be seen from 


onto the interval { 0, : on the y-axis. It follows that the range of the com- 


is the half-open interval (0 ‘|. The derivative 1s 
found using (4.1) and the Chain Rule: 


position arcsin 


2 


ee ee eee 
dx . eee ae NT + x 
(7 te a) 
ce l ie ae 
(1 as Pe J a l ¢ = x 
+22 
ania —2x 
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For any particular value of x, the quantity arcsin(In x) is defined if and 
only if In x is defined and also lies in the domain of the function arcsin, which 
is the interval [—1, 1]. Thus x must be positive, and In x must satisfy the 


es oe 
inequalities —1 < Inx < 1. Hence x must satisfy - < x < e. The domain 


; 1 
of arcsin(In x) is therefore the interval ; ; | , and the range is the same as 
e 


that of arcsin, i.e., the interval | : 5 | . The derivative is given by 
| d l 
ae arcsin(In x= Pes arena ee Inx = ——___—_- 
dx /1 = Gaye & xV/1 — (In x)? 


The integral formula corresponding to (4.1) is 


dx 
(4.2 ) {+S = arcsinx +c. 
aft = 
EXAMPLE 2. Find oe The first thing we do is to write the de 
. Fin Spee | - 
“ve =e 
nominator, as closely as possible, in the form \/1 — w?. 
I l I 


vee a) eG 


Hence, letting u = 


d 
we have = x and 


x dx 


-3/ x dx “bf | du 
f/f — x 2. eV A) SP po 
1 — (> 


By (4.2), 


du 
— xi arcsin u + Cc. 
a — ye ax 


Finally, substituting ~ . for u, we obtain 


| x dx 1 oS 
Se ee 
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The function cos does not have an inverse for the same reason that sin 
does not. However, a partial inverse can be obtained, just as before, by 
restricting the domain to an interval on which cos is either increasing or 
decreasing. Any such interval can be chosen. With the function sin it was 
natural to choose the largest possible interval containing the number 0—the 

; Tv men 
closed interval | 7 | . With cos the choice is less obvious. However, we 
shall select the interval [0,7], on which cos is strictly decreasing [see 
Figure 16(a)]. The function cos with domain restricted to [0, 7] then has an 


y-axis 


y-axis 


y = arccos x 


econ x, Ole xe @ 


(a) (5) 
Figure 16 


inverse, which is denoted cos? or arccos. As before, we shall use the second 
notation. Thus 


y= arcoos x “if and only if "x = "tos y 


and @< yp & 


The graph of arccos is shown in Figure 16(b). 
It should come as no surprise that the two functions arcsin and arccos 
are closely related. In fact, 


. TV 
(4.3 ) wen xX = — arccos x. 


T 7 T 
Proof. Let y = arccos x. Then arccos x = oe y,and so x = cos (; 2. r). 
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Hence 
T 1 a ee : 
x = cos G — y) = COM, Cony -- eas = sin y. 


Since 0 < arccos x < 7m, it follows that —ma < —arccosx < 0 and hence 


™ _ arccos x < = or, equivalently, — 5 — ys or This, together 


= 
a 
me NO 
ant] 
Il pol 


sin y, implies that y = arcsin x, and the proof is complete. 


Note that the validity of (4.3) depends on our having chosen arccos so 
that its range is the interval [0, 7]. 

It follows from (4.3) that the derivative of arccos is the negative of the 
derivative of arcsin. Thus 


(4.4 ) £ arccos x = — : 


ax Gt = x2 


iS 


I 
From (4.4) we see that another indefinite integral of SiS a 
ss 


—arccos x. 
Obviously, not one of the six trigonometric functions with unrestricted 
domain has an inverse. The function tan with its domain restricted to the 


see oe ; 
open interval (- = *) is strictly increasing and so has an inverse function, 


which we denote tan‘ or arctan. 


y = afttan x if and only if xX =n 


Tv 


2: 


Tv 


and — D 


oe 


The graph of arctan is obtained by reflecting the graph of tan with 
domain restricted to (- : ; 5) across the diagonal line y = x. It is shown 
in Figure 17(b). As can be seen from Figure 17(a), the function tan maps the 
open interval (- 5° *) onto the entire set of real numbers. That is, for 


2 


. 5 T UE 
every real number y, there exists a real number x in (- ae | such that 


y = tan x. Hence the domain of arctan is the whole real line (—x%, ~). 


The range is the interval (- : ; *) 
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TT 
<a ae 


TT 
y = tan x, 3 5) 


y-axis 


NIA 


y = arctan x 


(a) (b) 
Figure 17 
It is a corollary of Theorem (3.4), page 261, that arctan is a differen- 


tiable function. We compute the derivative by implicit differentiation. Let 
y= arctan x. then x = tan y, and 


= Se 
ao tan y or | = Ree) 5: 
Hence 
Bee 
dx sec®y 


From the identity sec?y = 1 + tan’y, we get sec’'y = 1+ x°. It follows 


dot = : >. Dous 
a f+ x 
g retan ee 
(4.5) ooo eee 


The corresponding integral formula 1s 


dx 
( 4.6 ) ise = arctan x + ¢. 
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1 
ies dx ’ 
EXAMPLE 3. Compute the definite integral | ue, We get immediately 
0 x” 
1 d i 
| i = arctan x| = arctan! — arctan 0. 
¢ tae 0 


Since tan 0 = Oand tan 5 = |, weknowthat0 = arctan 0 and 7 = arctan 1, 


1 
ts? 
0 Lae A 


This is a fascinating result: The number z is equal to four times the area 


Hence 


l 
bounded by the curve y = ex the x-axis, and the lines x = O and 
ao se 


The function cot is strictly decreasing on the open interval (0, 7). With 
its domain restricted to this interval, cot therefore has an inverse function, 
which we denote cot! or arccot. The relation between the two functions 
arccot and arctan is the same as that between arccos and arcsin, 


(4.7 ) arctot x = — arctan x. 


The proof is analogous to the proof of (4.3) and is left to the reader as 
an exercise. It is a corollary that the derivative of arccot is the negative of the 
derivative of arctan. Hence 


Nt 


Botemcs GE) Se 
eee dx Te 


poem I 
Here again, we see another indefinite integral of oat the function 
= 2rCcot xX. si 


The union of the two half-open intervals 0, 5) and (5, | consists 


of all real numbers x such thatO < x < wandx + ; . It can be seen from 
the graph of the equation y = sec x in Figure 13, page 308, that if a and b 
are two numbers in the union [o, 5) U @ | and if a #4 }), tiem 
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Orat 
sec a ~ sec b. We omit — because the secant of that number is not defined. 


It follows that the function sec with domain restricted to 0, =) U (5 ; | 


has an inverse, which is denoted sec‘ or arcsec. Thus 


y = arcsecx ifandonlyif x = secy 


and y is in| 0,5 ee 


The graph of the function arcsec is shown in Figure 18(b). Its range is the 


union [o, 4 U (: : | . As can be seen from Figure 18(a), the function sec 


maps the set [o, *) U ( : r| onto the set of all real numbers with absolute 


y-axis 


y = arcsec x 


(b) 


Figure 18 


value greater than or equal to 1. Hence the domain of arcsec is the set of all 
real numbers x such that |x| > 1. 
The derivative can again be found by implicit differentiation. Let 
y = aresec x. Then» = sec y and 
d 
1 


d ee et 
mt = Fy SOCYs which implies = sec y tan y 7 


Thus 
dy = l 


dx  secytany 
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Using the identity sec*y = 1 + tan’y and the equation x = sec y, we obtain 


sec ytany = pny/ ili 
ix > Lien. 2 = = and so sec y tan y is nonnegative. Hence 
sec plany = xn/ x2 = A ina 1. 


On the other hand, if x < —1, then . < y < 7m, and in this case both sec yp 


and tan y are nonpositive. Their product is therefore again nonnegative; 
ie. 
sec ytany = —xV/x? — 1 ifx < —1. 


It follows that both cases are covered by the single equation sec y tan y = 


oe. d 
|x| \/x? — 1. Hence ms = Pee. and we have derived the formula 


(4.9 ) = aresec xX = 


dx 


The final inverse trigonometric function is the inverse of the cosecant 


with domain restricted to the union | - _ 0) U (0 ‘|. This function, 


denoted csc‘ or arcesc, has range equal to | - ‘ : 0) U ¢ : 4 and domain 


equal to the set of all real numbers x such that |x| > 1. The analogue of 
(4.3) and (4.7) is valid. That is, 


(4.10 ) arcsec X = x — Aarccsc x. 


ka 
2 
Proof. The proof mimics that of (4.3). Lers'y = 3 — arccsc x. Then 


arcsec. = : — y, and so x = csc (; — ’) . Thus 
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Since — ‘ <— avecsc xs zs it follows that 0 < — arcesc x <7, Or, 


equivalently, that 0 < y <a. This, together with x = sec y, implies that 
y = arcsec x, and the proof is complete. 


From this it follows at once that 


d 
(4.11 ) 2 SS 
Ix|/x? — I 


dx 


PROBLEMS 
1. Evaluate the following. 
(a) arcsin 1 (i) arcsin(sin a) 
(b) arcsin 5 (j) arctan (tan 7) 
(c) arccos 5 (k) arctan (cot = 
(d) arcsin (- v3) (1) arcsin(cos a) 
(e) arctany/3 (m) tan[arctan(—1)] 
(f) arccot+/ 3 (n) arcsin(2 sin x cos x) 
2 
(g) arcsec (2..) (0) arcsin (sin ‘z) 
\/3 4 
(h) arccsc 2 (p) arctan (cot a) 
— ar 
2; Find re 
(2). y = aresin x” | (f) y = arcsec(1 + x’) 
(6) 9» = arctan a/x (g) y = arcsin(x + 1) + arccos(x + 1) 
Be Say 3 3 
(c} - » = efesm Pa (h) y= arctan x = carecot = 
(d) y = arccos(cos x) G) y = arctan(inx) 
(e) y = arccos(sin x) (j) yy =.arccas (*) — arcsec x. 
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3. What is the domain and range of each of the functions y of x in Problem 2? 
4. Find the following integrals. 


dx 
g | +2 . f* CS 


(b) = (f) if -25s 
af 2 xt s/1 — (x + 1)! 
y dy 
(c) {je (g) ie 
x dx 
ie _ a foes 


5. Prove the identity (4.7). 
6. Find < arccos x by implicit differentiation. 
x 


7. Evaluate the following definite integrals. 


1/2 a V3 . 
x i 
(a) ee (d) ee 
0 4/1 — x 0 V4—P 
“ped slg 
x i 
= e ees 
si 1/24/2x — x ss Ls 
u 
dt V2 
a / 
ete (f) | as 
1 xV/ x? — | 


8. (a) Draw the graph of the function arccot. 


(b) What is the domain and range of arccot? 


(c) Find * cot x by implicit differentiation. 
‘x 


9. Identify the function F defined in Example 1, page 203. 
10. Prove the identities 


u re © 
(a) arcsec x = arccos (‘) (b) arcese x= arcsin (2) 


11. Draw the graph of the function arccsc, and specify its domain and range. 
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12. Prove or disprove 
(a) arcsin(—x) = —arcsin x (c) arctan(—x) = —arctan x 
(b) arccos(—x) = arccos x (d) arccos(—x) = mw — arccos x. 
5. Algebraic and Transcendental Functions. We recall that a real-valued 


function f of one variable is a polynomial if there exist real numbers dp, 
Gi eens iat, for every real number x, 


f(x) = ao + ax +--+ + a,x" = > ayx”. 
k=0 


Thus, among the different functions f defined respectively by 


@) fx)= 6 +1822 Opes 
(b) f(x) = a", @) f= p>" 
Oo (h) fe) = 5, 
ce () fo) =VxF1, 
@ f= anx () fl) = ee — , 


only those defined in (a), (b), (h), and (j) are polynomials, and the rest are 
not. In asserting, for example, that the trigonometric function sin is not a 
polynomial, it is important to realize that we are stating more than just the 
obvious fact that sin x does not look like a finite sum of terms of the form 
a,x". We are asserting that it is impossible to write sin x in this form. The 
easiest way to prove this is to find some one property which every polynomial 
has and which sin does not have. For example, if fis a polynomial, then the 
derivative f’ is a polynomial of degree one less. Hence the jth derivative 


f'? is the constant function zero, if j is chosen big enough. On the other hand, 
a ge : 
the jth derivative os sin x is equal to +cos x or +sin x, and is therefore 
x 


never a constant. This proves that the function sin is not a polynomial. 

Similarly, a real-valued function F of two variables will be defined to be a 
polynomial if there exist real numbers a;;,/ = 0,..., mandj = 0,...,n, 
such that, for every pair of real numbers x and y, 


m 


Foxy) = > Yo aux'y’. 


7=0 7=0 


An alternative formulation, which avoids writing the double sum, is to define 
a polynomial in two variables to be a function which is the sum of functions 
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each one of which is defined by an expression ax'y’, where ais a constant, and 
i and j are nonnegative integers. Examples of polynomials in two variables 
are those functions defined by 

(a) F(x, y) = x8 + 2x*y + xy%, 

(b) G(x, y) = (x + y)yx, 

(c) f(x,y) = 17xy, 

(a) Hip) = TAZ, 

() ARS 

We come now to the two principal definitions of this section. A function 
f of one variable is said to be an algebraic function if there exists a polynomial 


F in two variables such that F(x, f(x)) = 0, for every x in the domain of f. 
A transcendental function is one which is not algebraic. 


EXAMPLE 1. The two functions 


2 
x i ae 
@) f@) = 24: (b) a(x) = VP +2, 
: | xia | 
are both algebraic. To show that fis an algebraic function, let y = ro ae a 


Then y(2x? — 1) =ix? + 1, of, equivalently, 2x y — y—-—2 = = 
Hence if we let F be the polynomial defined by 
AS Mesalt Sok Sisal) Teale. Saree © 


it will be true that F(x, f(x)) = 0. This is not surprising, since the poly- 
nomial F(x, y) was invented precisely to make the last equation true. Check- 
ing, we get 


2 
Fix fix)d a a Cre a 2 7x3 aHiy —x — 1 


I 
pe 
tO 
Ow 
| 
-_ 
— 
| 
| 
| 
Se 
| 
ios 


ee ee ee 


The function g can be shown to be algebraic by letting y = x3 + 2. 
Squaring both sides, we obtain y? = x*°+ 2, which is equivalent to 
y> — x* — 2 = 0. Hence if we define the polynomial F by the equation 


F(x, y) “ar y° a 4 ne eas 
then F(x, g(x)) = 0. Checking, we have 


F(x, g(x) = 0/98 + 27 x SH 2 
aig @ od gf esltnieayy 
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A rational function is one which can be expressed as the ratio of two 
polynomials. That is, a function f of one variable is rational if there exist 


polynomials p and gq of one variable such that f(x) = via . The technique 
q 


2 


| 
used in Example 1 to show that the function : ai 1 
= 


applied to any rational function of one variable. Thus we have the theorem 


is algebraic can be 


(5.1) Any rational function f of one variable is algebraic. 


Proof. Since f is rational, there exist polynomials p and q such that f(x) = ee Co 


g(x) 
Letting y = my we obtain yq(x) = p(x), which is equivalent to 


q(x)” 
yq(x) — p(x) = 0. The function F defined by 


F(x, y) = yq(x) — p() 
P(x) 


is a polynomial in x and y. Substituting f(x) for y, and then —— for f(x), we 
obtain q(x) 


F(x, fx) = fa) — p@) 


_ px) 7 
q(x) — p(x) 


P(x) — p(x) = 0, 


which completes the proof. 


The function g defined by g(x) = \/x* + 2 is an example of an algebraic 
function which is not rational. (A simple proof of this fact is suggested in 
Problem 2.) Thus the set of all rational functions of one variable 1s a proper 
subset of the larger set of all algebraic functions of one variable. 

It is by no means obvious that transcendental functions exist. However, 
we have actually encountered quite a few such functions already. Although a 
proof of the next theorem is too advanced to give in this book, it is important 
to know that it is true. 


(5.2) The following functions are transcendental: 


G) In, 
(ii) e”, 
(ii) a”, foranya > 0,a = |, 
(iv). log, x, for anya: > 0, a1, 
(vy). Sift X, COS X, fan %, COL %, sec x, csc x, 
(vi) -aresin.x, drccos x,. arctan x, arccot x, arcsec xX, arcesc x. 
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Another theorem [not so deep as (5.2), but still beyond the scope of this 
book] states that if fis an algebraic function, then the derivative f’ is also 
algebraic. However, the converse is false. In particular, we know that 


d 
A 
x 


dx 


i= ; a 
and — is not only algebraic, but also rational. In addition, the formulas in 
x 


Section 4 for the derivatives of the inverse trigonometric functions show that 
every one of these six transcendental functions has a derivative which is 
algebraic. 


PROBLEMS 


1. In each of the following examples identify the function fas a polynomial or not. 
If it is not a polynomial, give a reason. (Consider such things as the vanishing 
of higher-order derivatives, or the behavior of f, or of some derivative f™, 
near a point of discontinuity.) 


(a) fle) = = (©) fe) =Ve=2 
(b) fx) = = SS () fx) = xG°- 7) 
(c) f(x) = 1x +ex+2 ee far=< 

(d) fod = x" +x" (h) f(x) = tanx. 


2. Prove that the algebraic function g defined by g(x) = Vx? + 2 is not rational. 
[Hint: Suppose it is rational. Then there exist polynomials » and g such that 


‘/ x8 + 2 = a every x > —v/2. But then 
q(x 
2 
ee paca 
lakes q(x) 


or, equivalently, 
x? + 2)[gO)P — [Ip@)P = 0, forall x > —V2. 


The left side of this equation is a polynomial which is not identically zero. 
(Why?) How many roots can such a polynomial have ?] 

3. Prove that each of the following function f is algebraic by exhibiting a poly- 
nomial F(x, y) and showing that F(x, f(x)) = 0. 


Gy Fes a ee 
x — | dx 
(db) fix) = = arctan x (e) faye ins 
dx 
ic} i =  aresin x (iy Aix) = ok \/4x? — 1. 
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6. Complex Numbers. Since the square of a real number is never negative, 
the equation x” = —1 has no solution in the set R of all real numbers. 
However, we shall show that R can be considered as a subset of a larger set C 
which has the following properties: (1) The sum and product of any two 
elements in C are defined, and addition and multiplication obey the ordinary 
laws of algebra. (ii) There is an element 7 in C such that 7? = —1. (iti) Every 
element in C can be written in the form x + iy, where x and y are real num- 
bers. The elements of the set C are called complex numbers. Let us assume, 
for the moment, that the existence of C, obeying the three properties, has 
already been demonstrated. Then the sum of two complex numbers x; + 1)1 
and x. + iys is given by 


(x1 + iy1) + (2 + ie) = C1 + X2) + ii + ye). CP) 
For the product, we have 


(x, 33 1V1)(X2 + iyo) Xkas- 1X2 + pox = Pyiyy 


X1Xq + i(Xiy2 + Xoy1) + Piro. 


However, since > = —1, we get 
(x1 + iyi(%2 + ie) = (1X2 — Yiye) + iXiy2 + X21). (2) 
For example, 


Q+jNj+6-0=7- 27, 
(2 + i)(5 — 13) = 10+ i5 — 16 — 73 
= 10 — 7 ~.{—})3 
= iS 7, 


We turn now to the task of showing that there is a set C having the prop- 
erties listed in (i), (ii), and (iii). We shall take for C the set R° of all ordered 
pairs of real numbers, i.e., the xy-plane. Thus a complex number is by def- 
inition an ordered pair (x, y) of real numbers. Up to this point we have not 
ascribed any algebraic structure to the xy-plane, and so we must define what 
we mean by addition and multiplication of ordered pairs of real numbers. 
Later in this section we shall show how to express the ordered pair (x, y) 
in the traditional form x + iy. Anticipating this, however, we use equations 
(1) and (2) to motivate the definitions of the sum and product of ordered 
pairs. We define 


(X15 Va) aa Vo) = C+ Xa dvi+ yo), (3) 
(X1, Vi)(X2, V2) = (KX. — Vipr, Xiy2 + X21). (4) 


It was stated that addition and multiplication of complex numbers are to 
obey the ordinary laws of algebra. By this we mean that the following six 
propositions are true. The basic algebraic properties which they describe 
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are the same as those for the real numbers, and these six statements should be 
compared with the corresponding list on page 2. Abbreviating (x1, y,), 
(X2, 2), and (Xs, y3) by 21, Z2, and Z3, respectively, we have 


(6.1) ASSOCIATIVE LAWS. 
Zi + (22 + 23) = (21 + 22) + Z3, —-Z1(ZoZ3) = (2122)Z3. 
(6.2) COMMUTATIVE LAWS. 
AL 22 = 2a Sy Z1Z2 = 2921. 
(6.3) DISTRIBUTIVE LAW. 
(21 + Z2)Z3 = 2123 + ZoZ3. 


(6.4) EXISTENCE OF IDENTITIES. The two complex numbers 0’ = (0, 0) 
and 1’ = (1,0) have the properties that 0’ + z = z and 1’z = z for every 
gin ©. . 


(6.5) EXISTENCE OF SUBTRACTION. For every complex number z = (x, y), 
the complex number (—x, —y) is denoted by —z and has the property that 
z+ (—z) = O’. [The expression z; — Z, is an abbreviation for z; + (—Z2).] 


(6.6) EXISTENCE OF DIVISION. For every complex number z = (x, y) 


Xx ay ° —— 
: is denoted by z~} 
Pty et =) ‘ 


| ee ’ 
or — and has the property that zz~' = 1’. | The expression ~ is an abbrevia- 
2 22 


different from 0’, the complex number ( 


tion for og, 


Proof. The proofs are simple exercises using the definitions and the algebraic 
properties of real numbers. We give the proofs of (6.4) and (6.6) and leave 
the others for the reader to supply. It is asserted in (6.4) that the complex 
number (0, 0), which is abbreviated 0’ (and later simply as 0), is an additive 
identity. Letting z = (x, y), we have 


0'+ z= (0,0) + @ y) = 0+ %,0+ y) = &, y) = z, 


which proves the assertion. Similarly, for the multiplicative identity 
1’ = (1, 0) (later to be abbreviated simply by 1), we obtain 


I’z = (1, O)(x, y) = (lx — Oy, ly + x0) = (x, y) = z, 


and the proof of (6.4) is complete. 
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To prove (6.6), let z = (x,y) be a complex number different from 
0’ = (0,0). It follows that x? + y? is positive. Hence the ordered pair 


x —y ; rene 
; is defined, and (in anticipation of the proof) is denoted 
(3 ey (he Ge > : ad 

z_!. Multiplying, we get 


zz = (x,y) ce 


AP aE VE ok ge wr 


xX 


2 y 
= 
- (S45 ee a a5) 


x+y 
= (S24. Of = 04.0) 


which completes the proof of (6.6). 


Assuming that the remaining four propositions have been proved, we 
have now satisfied requirement (i) in the first paragraph of the section for the 
set C of complex numbers: Addition and multiplication are defined and obey 
the ordinary laws of algebra. But what about the prior assumption that the 
set R of all real numbers can be considered a subset of C? Of course, it is not 
actually a subset, since no real number is also an ordered pair of real numbers. 
However, there is a subset of C which has all the properties of R. This subset 
is the x-axis, the set of all complex numbers whose second coordinate is zero. 
Speaking informally, we shall identify R with the x-axis in C by identifying 
an arbitrary real number x with the complex number (x, 0). Proceeding 
formally, we define a function whose value for each real number x is denoted 
by x’ and defined by x’ = (x, 0). This function sets up a one-to-one corre- 
spondence between the set R and the x-axis in C. Essential, however, is the 
fact that this correspondence preserves the algebraic operations of addition 
and multiplication. To show that this is so, let x; and x2 be any two real 
numbers. Then, for addition, 


xi + X2 = (%1, 0) + (%2, 0) = (%1 + X2,0) = C1 + 2)’, 
and for multiplication, 


wx) = Cy, OO, 0) = site 0+0, x 0+ 2-0) 


= (8 yxa Oe xa) 


It follows that the algebraic properties of the set R of all real numbers are 
identical with those of the x-axis in C. It is therefore legitimate to make the 
identification, and henceforth we shall denote x’ simply by x. Note that in so 
doing, the additive identity 0’ and the multiplicative identity 1’, referred to in 
(6.4), become simply 0 and 1, respectively. 
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We now define 7 to be the complex number (0, 1). Requirement (ii) in 
the first paragraph of this section is easily seen to be satisfied: 


2? = (0, D0, 1) = 0:0 —1-1,0-14+ 0:1) 
=({-19 = == 


Since we have agreed to write (1, 0) = 1’ = 1, we therefore obtain the famous 
equation 


(6.7 ) P= =|, 
Requirement (i11) is 
(6.8) If z = (x, y) is an arbitrary complex number, then z = x + iy. 


Proof. The expression x + iy is an abbreviation for the more formal x’ + iy’. 
Hence 


x + iy = (x, 0) + @, 1)U, 9) 
= (x,0) + (0 ea ey sh) 
(x, 0) + (0, y) = (x, y) 


= his 


completing the proof, and also our construction of the set C of complex 
numbers. 


EXAMPLE 1. H z, = 4+ 73, z = 4 — 13, and z3; = 7 — i2, then find 
Zi + Ze, 21Z2, 3Z; — 2Z3, and z,z3. These are simply routine exercises in- 
volving the addition, subtraction, and multiplication of complex numbers. 


2 +e = 4 2B) Ee = ES, 
in = 4+ Oe — 6) = 164 1 
Dg (2 1. 
9 = Or = 3 + B= OS 
= bat .— ie = 2 +, 
ses Ay PS pre Pr Sie 
28 — (—D6--huld = 34 als. 
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EXAMPLE 2. If z; = 2 + 73 and z,. = 3 — i, plot 2, Z., and z,; + z. on the 
complex plane. We have 


fate Zs (2+) 443-5 A) = Sork 2, 


and the three points are shown in Figure 19. 


y-axis 


Sess $43 


Figure 19 


A complex number is a point in the xy-plane and can be indicated in 
a picture by a dot. Another useful geometric representation of z is an arrow 
with its tail at the origin and its head at the point z. We have drawn these 
arrows in Figure 19. Note that if P is the parallelogram whose adjacent sides 
are the arrows representing Zz, and Z», then the diagonal of P which has the 
origin as an endpoint is the arrow representing the sum z, + Z». The definition 
of addition in C implies that this parallelogram principle is valid for every 
pair of complex numbers. It provides a good method for adding complex 
numbers geometrically. 

When the complex number / was first introduced in mathematics, it was 
regarded as highly mysterious and was called an imaginary number, and this 
terminology has survived. If z = x + iy is an arbitrary complex number, 
then by definition x is the real part of z, and y is the imaginary part of z. 
Note that the imaginary part of a complex number is a real number. A 
complex number whose real part is zero, i.e., one that lies on the y-axis, is 
called pure imaginary. It is important to remember that 


(6.9) Two complex numbers are equal if and only if their real parts are equal 
and their imaginary parts are equal. That is, x; + iy, = X2 + ips if and only if 
X1; = Xo and y = Vos 
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Proof. We know that x1 + iy1 = (%1, y1) and x2 + ive = (Xe, y2). But two ordered 
pairs are equal if and only if their first coordinates are equal and their second 
coordinates are equal, and so (6.9) is proved. Another proof, which uses only 
the algebraic properties of complex numbers rather than their explicit con- 
struction as ordered pairs, is the following. Suppose that x; + iy; = 
Xo + iyo. Then x; — x2 = i(y2 — yx), and squaring both sides we obtain 


(x1 — X2)? = P(e — yi)? = —(o — yx)”. 


The left side is a nonnegative real number, and the right side is a nonpositive 
real number. They can be equal only if both are zero. Hence x; = x2 and 
yo = yy. The converse proposition is, of course, trivial. 


The absolute value, or modulus, of a complex number z = x + iy is a 
nonnegative real number denoted by |z| and defined by 


lz] = Vx + y. 


The geometric significance of |z| is that it is the distance between the point z 
and the origin in the complex plane. If z is represented by an arrow, then |z| 
is the length of the arrow. Note that if z is real, 1.e., if its imaginary part is 
equal to zero, then the absolute value of z is simply its absolute value as a real 
number. Thus, if z = x + iy and y = 0, then 


le] = V8 + He = |e. 


An important property of the absolute value is the following. 


(6.10) The absolute value of the product of two complex numbers is the product 
of their absolute values; 1.e., |21Z2| = |Z1| [22]. 


Proof. Let z; = x; + iy; and zo = x2 + iyo. Then we have 212. = x1x2 — 
yiv2 + i(x1y2 + X2y1). Hence, by the definition of absolute value, 


|Z1Z2|? = (xix2 — yiye)? + (Cxiye + x21). 
Simplifying, we get 


|Z1Z2|?> = x12x_? — Axixeyiye + yr?yeo® + xiy0? + Aixeyiye + x2yP 
= X1°(x2? + yo”) + yi? (x2? + yo”) 
= (x1? + yi?)(x2? + yo”) 
= |z1)? |ze|?. 


Thus |z1z2|? = |z,|* |zo|?, and the proof is completed by taking the positive 
square root of each side of the equation. 
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As an illustration of Theorem (6.10), consider the complex numbers z, 
and z. shown in Figure 19. The product z,z. is equal to 


Zo = (2+03)3 —) =64+ 129 — 72 — 723 
= 9+ j7. 

The absolute values are 

ae = ye a 8? 4/7 1, 

ize] = V3? + (—12 = V'10, 

[ZZe) = \/92 + 72 = +/130, 
which is in agreement with (6.10). 
If z = x + iy, then the complex conjugate of z, denoted by Z, is defined 


to be the complex number 
2 = eo ay, 


ty 


The product of a complex number and its complex conjugate is always a 
nonnegative real number, since 
zz= (x + iypl(x -—iy=x+4+ y’. 


Since x? + y® = |z/?, we obtain the formula 


The complex conjugate is a useful tool for computing the real and 
imaginary parts of the quotient of two complex numbers. If z; and Ze are 


given and if z. ¥ 0, then 
a4 i aN Zo “a Z122 


me ae Z|? 


and the denominator of the right side is a real number. 


EXAMPLE 3. Compute the real and imaginary parts of the complex number 
yar a3 
7 — i20 


The complex conjugate of 7 — i2 is the number 7 + i2. Hence 


Ta aT 2) 


7 ea a Fe 
Since (5 + 737 + i2) = 35 — 6+ i21 + i10 = 29 4+ 331, we obtain 
5+ 173 29+ 231 


7—i2 53 


a <ol 
Baggs bhai 


Thus the real part is 23, and the imaginary part is 24. 
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PROBLEMS 


. Prove Propositions (6.1), (6.2), (6.3), and (6.5). 
. Perform each of the indicated operations and write the answer in the form 


x + iy. 
@ 64440 6) a 
4 — i3 
= ee € § ; 
b) (-2+iV2)+(-2-W2) ) SOS 
© (tine =O 
3a — i3b 
3 —i7 —2—-—i 
aie Oo = s 
(e) (a+ ib)(a — i2b) (k) pe 
3 — i4 
(f) 2(4 — i3) + 7(—2 + 15) dad) (2+ i7)Q2 — 15). 
. Letz, = 2+ i3,z. = —1 — i,and z3 = i. Plot each of the following complex 
numbers in the complex plane. 
(a) 23 (d) 2123 
(b) Ze (e) 2, — 2Z 
(c) z+ 2 = 
22 


. Find the complex conjugate of each of the following complex numbers. 


(a) 2 — 13 (e) 4(2 — 13) 
(b) 5+ i4 i een 
aj os ey a. 


(d) (2 — i3)5 + #4) 


. (a) In Problem 4, compute the sum of conjugates formed in (a) and (b), 


and compare with the conjugate of the sum in (c). 


(b) In Problem 4, compute the product of the conjugates found in (a) and 
(b), and compare with the conjugate of the product found in (d). 


(c) In Problem 4, multiply the conjugate found in (a) by 4, and compare with 
the answer found in (e). 


. For any complex numbers z; and Zs, prove that 


(a) Zi + Zo = 2, + Be ss AD, Za ee oO). = he 


. If ais real and positive and z complex, prove that |az| = alz]. 
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8. (a) Prove that the sum and product of two complex numbers which are 
conjugates of each other are real. 


(b) Prove that the difference of two complex numbers which are conjugates of 
each other is pure imaginary. 


9. Graph all points z satisfying 


(ay fl = 2 () 2<|z| <4 

(b) \z| = 2 (ff) |z— 2| 2 

)-4z| > 1 (g) |z — zo| = 3, fora fixed zo 
ies 2 wets es Se 


10. Given two complex numbers z; and Zs, plot them and give a geometric inter- 
pretation of |z; — zo]. 
11. (a) Show that —1 has two square roots in C. 


(b) Show that every nonzero real number has two square roots: 


7. The Complex Exponential Function e*. Consider the function ¢ defined 
es g(x) = cos x + isin x, 

for every real number x. This is a complex-valued function of a real variable. 
The domain of ¢ is the set R of all real numbers. For every real number x, 
we have A ee 


It follows that v(x) is a point on the unit circle in the complex plane, 1.e., the 
circle with center at the origin and radius 1. Conversely, every point on the 
unit circle is equal to (cos x, sin x), for some real number x, and we know 
that (cos x, sin x) = cosx + isin x. It follows that the range of ¢ 1s the 
unit circle. 

The function ¢ has the following properties: 


(7.1 ) gt= F 
(7.2 ) g(a)y(b) = g(a + Db). 
g(a) _ 
l 
(7.4 ) y(—a) = re ; 


Proof. The proofs are completely straightforward. Thus (7.1) follows from the 
equations 
y(0) = cos0O + isinO = cosO = 1. 


To prove (7.2), we write 


y(a)ye(b) = (cosa + isina)(cos b + isin db) 
cosacos 6 — sina sind + i(sina cos b + cosa sin b). 
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The trigonometric identities for the cosine and sine of the sum of two numbers 
then imply that 

g(a)g(b) = cos(a + b) + isin(a + 5B), 
and the right side is by definition equal to g(a + 5). Thus (7.2) is proved. 
As a special case of (7.2), we have 


g(a — b)g(b) = g(a — b + b) = (a). 


On dividing by y(4), which is never zero, we get (7.3). The last result, (7.4), 
is obtained by taking a = 0 in (7.3) and then substituting 1 for g(0) in ac- 
cordance with (7.1). Thus 


oT = Bt By. 
aa mp oe 


The above four properties of ¢ are also shared by the real-valued ex- 
ponential function exp [we recall that exp(x) = e*]. This fact suggests the 
possibility of extending the domain and range of exp into the complex plane. 
That is, it suggests that the functions yg and exp can be combined to give a 
complex-valued exponential function of a complex variable which will have 
the property that when its domain is restricted to the real numbers, it is 
simply exp. We define such a function now. For every complex number 
z = x + iy, let Exp be the function defined by 


Exp(z) = exp(x)¢(Q). 
Thus 
Exp(z) = e*(cos y + isin y). 


Mizoeox 4+:40,<thetiegie xcand Exp) = exp@)e() = expi)) Meneeiie 
function Exp is an extension of the function exp. 

It is a routine matter to show that the function Exp has the exponential 
properties listed above for y. Following the practice for the real-valued 
exponential, we shall write Exp(z) as e*. In this notation therefore, if 
z= x + jy, the definition reads 


e® = e*(cosy + isin y). 


The exponential properties are 


(7.1’) e° = |. 

(12) e7 1972 — el 172, 
21 

C73) 2 ere 
e”? 

(7.4) Lane ca 
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Proof. The proofs simply use the fact that the functions exp and ¢ separately have 
these properties. Thus 


Swe = exmpije@) = ‘}-1 = 1, 
Letting z} = x; + iy; and z2 = x + iyo, we have 


exp(x1)o(Vi)exp(*2) ¢()2) 
exp(x1 + X2)¢()1 + 2). 


e*1e72 


Since Zz; + Zz. = (x1 + x2) + i(y1 + yo), the right side is by definition equal 
to Exp(z; + zs), which is e%1*%2, The last two propositions, (7.3’) and 
(7.4’), are corollaries of (7.1’) and (7.2’) in exactly the same way that (7.3) 
and (7.4) follow from (7.1) and (7.2). 

If x is an arbitrary real number, then 


ere eo = e(cos x +i sin x). : 


Thus we have the equation 


e’* = cosx + isin x, for every 


a) real number x. 


Thus if x is any real number, the complex number e”” is the ordered pair 
(cos x, sin x). Hence e” is the point on the unit circle obtained by starting 


imaginary axis 


a eix= cosx+isin x 


real axis 


Figure 20 


at the complex number | and measuring along the circle at a distance equal 
to the absolute value of x, measuring in the counterclockwise direction if x 
is positive and in the clockwise direction if it is negative (see Figure 20). In 
terms of angle, x is the radian measure of the angle whose initial side is the 
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positive half of the real axis and whose terminal side contains the arrow 
representing Z. 

Letting x = 7 in (7.5), we get e’” = cosa + isin. Since cos7 = —1 
and sin 7 = 0, it follows that e’" = —1, which is equivalent to the equation 


This equation is most famous since it combines in a simple formula the three 
special numbers z, e, and 7 with the additive and multiplicative identities 
0 and 1. 

One of the most important features of the complex exponential function 
is that it provides an alternative way of writing complex numbers. We have 


(7.6) Every complex number z can be written in the form z = |z\e"', for some 
real number t. Furthermore, if z = x + iy and z # 0, then z = |z\e” if and 


xX ; 
only if cos t = — and sint = ase 
Iz I2| 


Proof. If z = 0, then |z| = 0, and so 0 = z = |zle*’, for every real number ¢. 


Next we suppose that z ~ 0. Then |z| # 0, and ia is defined and lies on the 
vis 


unit circle because 


; Zz 

Hence there exists a real number f¢ such that iz] = e"! and this proves the 
Z 

first statement in the theorem. Suppose that z = x + iy and that z ¥ 0. 

if 2. ="|zie", then 


x + iy = |zle* = |z|\(cost + isin). 


Two complex numbers are equal if and only if their real parts are equal and 
their imaginary parts are equal. Hence x = |z| cost andy = |z| sin r. Since 


\z| # 0, we conclude that cos t = i and sin t = F . Conversely, if we start 
PA z 


from the last two equations, it follows that 

x + iy = |z\(cost + isin #). 
The left side is equal to z, and the right side to |z|e’’.. This completes the 
proof of the theorem. 


A complex number written as z = |z\e’’ is said to be in exponential 
form. The number |z| is, of course, the absolute value of z, and the number f 
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is called the angle, or argument, of z. The latter is not uniquely determined by 
z. Since the trigonometric functions sin and cos have period 27, it follows 
that 
= ae 
for every integer n. 

Consider two complex numbers written in exponential form: 


its 


Z71, = Izi|e*” and 22> |Zs|e 
The product and ratio are given by 
re it) it oa [Z| oe 
Z1Z2 = |2;| |Z2|e’1e"?, a Pre pita 


Hence by formulas (7.2’) and (7.3’) for the product and ratio of exponentials, 
we have 


i(ty +12) 41 1a, 


225 = /Z;| FAG : re S = a(ty—tg), 


€ 


That is, two complex numbers are multiplied by multiplying their absolute 
values and adding their angles. They are divided by dividing their absolute values 
and subtracting their angles. 


41. 
EXAMPLE |. Let z; = 3 + /4 and z. = —2j. Express 21, 2, 21Z2, and — in 
Z2 


the exponential form |z\e’’, and plot the resulting arrows in the complex 
plane. To begin with, 


[Z| = \/32 —+ “= = 
zo) = /0? + (—2Y = 2. 


We next seek a real number ¢; such that cos ft; = 2 and sin ft; = 4, and also 
a number fy such that cos ¢; = O and sin tp = —1. These are given by 


ty = arccos 2 = 0.93 (approximately), 


te = arcsin(—l1) = — —- 
2 
Then 
21> lzije*” = 5 aaa 


Zo = |zole"® = Qe?!) 
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Since #; + ft. = 0.93 — ; = —0.64 (approximately) and ft; — ft = 0.93 — 
(- *) = 2.50 (approximately), we obtain 


i(ty+te) 10e7 200-84) 
b) 


ZiZ2 = |z:| |Zsle 
oa \Z1| sO > i250) 
22 |Z9| 


: 21 ‘ ; 
The arrows representing Zi, Z2, 21Z2, and — are shown in Figure 21. To 
Z2 


imaginary axis 


real axis 


= 10 gis 


Figure 21 


locate these numbers geometrically using a ruler and protractor marked off in 
degrees, we would compute 


f= O93 radian: — 53 dearess, 
bQ= = 5 tadians = —90 degrees, 


ti + te = —0.64 radian = —37 degrees, 
ti — t, = 2.50 radians = 143 degrees. 
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Of course, we can find the real and imaginary parts of z,z, and ae by the 
computations a 
ZZ. = (3 + 14)(—2i) = 8 — 66, 
Meee. 3 2 8 Ht IO 


cs ae sa ee a a ae peal 2 eee oem 3. 
a r ee 


If z is a complex number, then z” can be defined inductively, for every 
nonnegative integer n, by 


rage a ee 
Se), forn > 0. (24 


Another useful property of the complex exponential function is 
CET} (e7)” = e”*, — for every nonnegative integer n. 


Proof. By induction. If n = 0, then (e*7)” = (e)°. Since e? is a complex number, 
(e7)® = 1, by equation (1). Moreover, in this case, e”* = e* = 1, by (7.1’). 
Next suppose that n > 0. By equation (2), we have (e*)” = e*(e*)"—1, and 
by hypothesis of induction (e7)”—! = e—*, Hence, by (7.2’), 


(e*)” eke eze(r—De eae ge 


and, since z + (n — 1)z = nz, the proof is finished. 


Let z be a complex number and 7 a positive integer. A complex number 
w is said to be an nth root of z if w” = z. We shall now show that 


(7.8) If z # 0, then there exist n distinct nth roots of z. 


Proof. Let us write z in exponential form: z = |z\e’*. By |z|!/" we mean the positive 
nth root of the real number |z| (which we assume exists and is unique). Con- 
sider the complex number 

em Pe la Se a 


It is easy to see that wp» is an nth root of z, since 


Wo” we (a ye” Vz 


Iz|e" 
ae 


However, wo is not the only nth root. We have already observed that 


Lt \(t-+2rk 
z= |zle" = |z\e“t? , 
for every integer k. If we set 
1/n geek 
w= ie ee 
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then all these numbers are seen to be nth roots of z, since each one satisfies 
the equation w,,” = z. However, they are not all different. Note that w;+1 1s 


; " ees Z 
equal to the product w,e*?7/”, The angle of e’?"/” is — radians, and sl 
n n 


is one nth the entire circumference of the unit circle. Thus w;41 is obtained 
20 ; : 
from w; by adding an angle of — radians, or, equivalently, by rotating w;, 
n 
1 ; — 
exactly — of an entire rotation. If we begin with w; and form wz41, Wr+2,.-- 
n 


by successive rotations, when we get to w;z4n we will be back at w;,, where 
we started. Thus there are only n distinct complex numbers among all the 
w’s. In particular, 


are n distinct nth roots of z. This completes the proof. 


An nth root of z is a solution of the complex polynomial equation 
— z = 0. It is a well-known theorem of algebra that a polynomial equa- 


tion of degree n cannot have more than n roots. Hence we can strengthen the 
statement of (7.8) to read that every nonzero complex number z has precisely 
n distinct nth roots. 


imaginary axis 


z=1+i 


real axis 


Figure 22 


EXAMPLE 2. Find the three cube roots of the complex number z = | + J, 
and plot them in the complex plane. Writing z in exponential form, we have 


g=*/te"" 


(see Figure 22). Hence the three cube roots are 


cad wl4+2rk 
we (dy eS oh ae SD 
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= l 
Since (\/2)!/? = 2/6 = 1.12 (approximately) and 5 (*) radius = 15 degrees, 


we see that w9 is the complex number lying on the circle of radius 1.12 about 
the origin and making an angle of 15 degrees with the positive x-axis. The 
other two roots lie on the same circle and have angles of 15 + 120 degrees 
and 15 + 240 degrees, respectively. The three roots are thus V2 e%\7/”, 
av) git, and /2 et (llr /12)_ 


PROBLEMS 


1. Write each of the following complex numbers in the exponential form |z|e“. 


(a) 1+i (ey <5 
i t—i (f) e", where z = 2+ 17 
() 1+ 1/3 (g) 5 , 
eet /3 | 2: (h) i. 


2. Let z} = V3 +i and z=1-—#i. Write each of the following complex 
numbers in the exponential form |z|e and plot it in the complex plane. 


21 


(a) e” (d) i 
(b) e? (e) 2e*! 
(c) ete (f) -\fe1)*. 
3. Find the real and imaginary parts of each of the following complex numbers. 
(a) al (d) aft a ae 
oy 3°" (e) »+/34e”, where ¢ = arcsin 
34 
in~ 1%(1/4) 42 2 ; 2 
ie) 2:28 (f) +/13e°, where sint = — ——- 


V/13 
4. If z; = |z,\e’‘, what is the exponential form of its complex conjugate Z,? 
5. Derive (7.3’) and (7.4’) using (7.1’) and (7.2’). 
6. Let 2 be a positive integer, and let z” be defined as in the text. If z ¥ 0, define 
1 


an 
“a 


Theorem (7.7) holds for all integers. 
7. Find and plot the nth roots of z in each of the following cases. 


in w= 3 andz = 3 (d) n= 4andz=1 
me a= Zand z = i (eo) n = Sand z = 2i 
i) 27 = send 2 = 2 (f) n=3andz=1+ivV3. 


ek 


, and thence show that (e7)—" = e—”*.. Asa result, we know that 
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8. How would you define the function 57? 
9. (a) Using the equation e* = cos x + isin x and the fact that (e”)” = e’”, 
prove that 
cosnx + isinnx = (cos x + isin x)”. 


This is known as de Moivre’s Formula. 


(b) Using de Moivre’s Formula and the Binomial Theorem, find trigonometric 
identities for cos 3x and sin 3x in terms of cos x and sin x. 


10. Every complex-valued function f of a real variable determines two real-valued 
functions f, and /2 of a real variable defined by 


fi(x) = real part of f(x), 
fo(x) = imaginary part of f(x). 


Thus f(x) = fi(x) + ifo(x) for every x in the domain of f. We define the 
derivative f’ by the formula 


f'(x) = fix) + ife(e). 
Applying this definition to the function f(x) = e”, show that 


ad a = Pia 
dx : 


8. Differential Equations. In this section we shall show how to obtain the 
general solution of any differential equation of the form 


ae. 


where a and 6 are real constants. Differential equations of this type occur 
frequently in mechanics and also in the theory of electric circuits. Equa- 
tion (1) is a second-order differential equation, since it contains the second 


eee ae, 
derivative = but no higher derivative. It is called linear because each one of 
x 


d a’ ee d 
y, a , and =s occurs, if at all, to the first power. That is, if we set = Z 
dx x dx 


@? 
and a — w, then (1) becomes w + az + by = 0, and the left side is a 


linear polynomial, or polynomial of first degree, in w, z, and y. A second- 
order linear differential equation more general than (1) is 


os + f(x) + g(x)y = h(x), 
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where f, g, and /are given functions of x. Equation (1) is a special case, called 
homogeneous, because /: is the zero function, and said to have constant co- 
efficients, since f and g are constant functions. Thus the topic of this section 
becomes: the study of second-order, linear, homogeneous differential equa- 
tions with constant coefficients. 

An important and easily proved property of differential equations of this 
kind is the following: 


(8.1) Jf y; and y2 are any two solutions of the differential equation (1), and if 
c, and cz are any two real numbers, then cyy, + Cope is also a solution. 


Proof. The proof uses only the elementary properties of the derivative. We know 


that 
< (ciyi + Coy) = oO + eee 
Hence 
Ls (Ciy1 + C2y2) = £ (« os ae #) 


To test whether or not ci; + coy is a solution, we substitute it for y in the 
differential equation: 


9 


do d 
— (ciyi + Coy2) + a= (ery + C2y2) + b(ciy1 + C2y2) 
dx? dx 


ee ay, 
Cl de C2 de acy ae ace We C1y1 C2y2 


d’ d d’ d 
C1 4 gt + by:) +0 se ld ae ‘ 
dx dx? 


dx? 


The expressions in parentheses in the last line are both zero because ); and y2 
are by assumption solutions of the differential equation. Hence the top line 
is also zero, and so cy; + C22 is a solution. This completes the proof. 


It follows in particular that the sum and difference of any two solutions 
of (1) is a solution, and also that any constant multiple of a solution is again 
a solution. Finally, note that the constant function 0 is a solution of (1) for 
any constants a and 5b. 

In Section 5 of Chapter 5 we found that the general solution of the 


—kz 


| oe 
differential equation = + ky = Ois the function y = ce “*, where c is an 
x 


arbitrary real number. This differential equation is first-order, linear, homo- 
geneous, and with constant coefficients. Let us see whether by any chance an 
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Pipa paseo might also be a solution of the second-order differential 
equation © = ari »@ oe cy-= 0, Let.» =. 2, where 7 is any teal pum. 


Then 


Hence 


d’y 3 15 ee rx rx 
pee o + by = re + are’ + be 


(r” + ar 2 Oe". 


Since e”” is never zero, the right side is zero if and only if 7? + ar + 5 = 0. 
That is, we have shown that 


(8.2) The function e" isa solution o oe by = 0 if and only if the 


real number r is a solution of t®? + at re b: = i 


The latter equation is called the characteristic equation of the differential 
equation. 


ae d 
EXAMPLE |. Consider the differential equation a — = — 6y = 0... Its 
x x 
characteristic equation is ? — t — 6 = 0. Since ? — t— 6 = (¢ — 3)@ + 2), 
the two solutions, or roots, are 3 and —2. Hence, by (8.2), both functions 
e®* and e *” are solutions of the differential equation. It follows by (8.1) that 


the function 


ae 


y = ce” + cre 


is a solution for any two real numbers c, and co. 


The form of the general solution of the differential equation 


id ory Me Ja PEG 


depends on the roots of the characteristic equation fr? + at + b = 0. There 
are three different cases to be considered. 


Case 1. The characteristic equation has distinct real roots. This is the 
simplest case. We have 


P+at+b= (t — ryt — ro), 
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where 7, and r: are real numbers and 7; # rs. Both functions e”!” and e’2” 
are solutions of the differential equation, and so is any linear combination 
cye"1* + ec ,e’2*. Moreover it can be shown, although we defer the proof until 
Chapter 11, that if y is any solution of the differential equation, then 


for some two real numbers c; and cy. Hence we say that (2) is the general 
solution. In Example 1 the function c,e*” + c,e *” is therefore the general 
; Se d 
solution of the differential equation itd Sa 6y = 0. 
dx dx 


Case 2. The characteristic equation has complex roots. The roots of 
e+ at+ b = Oare given by the quadratic formula 


—@ ss wat — ab 
fy ts= ————_——: 
2 
Since a and bare real, r; and r, are complex if and only if a? — 46 < 0, which 
a \/4b6 — @? 
we now assume. Settinga = — 5 and 6 = lig eh we have 


n=act ig, fo = ae = 18. 


Note that 7, and r, are complex conjugates of each other. 

Motivated by the situation in Case 1, in which 7; and r, were real, we 
consider the complex-valued function c,e1* + c.e"2*, where we now allow 
c, and cz to be complex numbers. We shall show that 


(8.3) If c; and cy are any two complex conjugates of each other and if r; and rz 

are complex solutions of the characteristic equation, then the function 
Bee see 

is real-valued. Moreover, it is a solution of the differential equation (1). 


Proof. Let c, = VY + i6 and cp 
we have 


Y — i6. Since r, = a+ i8 and ro = a — if, 


on ste ese"?” a (Y re ibye*t*?* ae (Y a pen Ps 
ey + ide + Cy — ide. 


Recall that e” = cos 6x + isin Bx and e—'®* = cos(—6x) + isin(—Bx) = 
cos 8x — isin Bx. Substituting, we get 


oe” + Ge? =e WY + (cos. Bx + isin Gx) 
+ (Y — i6)(cos Bx — isin Bx)] 
= e* (27 cos Bx — 26 sin Bx). 
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The right side is certainly real-valued, and this proves the first statement of 
the theorem. Since Y and 6 are arbitrary real numbers, so are 2Y and —26. 
We may therefore replace 2Y by k; and —26 by ko. We prove the second 
statement of the theorem by showing that the function 


y = e*(k;, cos Bx + ke sin Bx) C3} 
is a solution of the differential equation. Let y, = e**cos8x and 
yo = e*sin Bx. Since y = kyy1; + koyo, it follows by (8.1) that it is enough 
to show that y; and y2 separately are solutions of the differential equation. 


We give the proof for y; and leave it to the reader to check it for yo. By the 
product rule, 


d ax ax aris 
Se (e cos Bx) = ae cos Bx — Be sin Bx 


= e“(acos Bx — B sin Bx). 


Hence 
fe = ae“ (a cos Bx — Bsin Bx) + e“(—aB sin Bx — B° cos Bx) 
= e“[(a’ — B°) cos Bx — 2a8 sin Bx]. 
Thus 
dys 


——+ a4 on + by, = e“[(a’ — 8°) cos Bx — 2a sin Bx] 
+ ae“"(a cos Bx — B sin Bx) + be*’cos Bx 
= e“([(a’ — 8”) + aa + b] cos Bx — B(2a + a) sin Bx). 


But, remembering that r, = a + i8 and re = a — if and that these are the 
roots of the characteristic equation, we read from the quadratic formula that 


eee ge 
ie Sty See se, 
Hence 
. Al, = 5 \* 
(aw — B)+aa+b= (-:) -(Y4e-#) ta(-2) +5 
Z 2 2 
hy ¥ 
Pepe es = ae > 
and also 
2a +a =2(—5)+a= —at+a=Q, 
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whence we get 


ay aie oe + by; = e(0- cos Bx — 8-0- sin Bx) 
ar dx 


= 0, 


and so y; isa solution. Assuming the analogous proof for ys, it follows that 
y, as defined by (3), is also a solution and the proof is complete. 


It can be shown, although again we defer the proof, that if y is any real 
solution to the differential equation (1), and if the roots 7; and r of the char- 
acteristic equation are complex, then 


for some complex number c, and its complex conjugate c.. Hence, if the roots 
are complex, the general solution of the differential equation can be written 
either as (4), or in the equivalent form, 


y = e*(k, cos 8x + ky sin Bx), (x) 


where 7; = a + i8 and rp = a — iB, and k, and ky are arbitrary real 
numbers. Note that solutions (2) and (4) look the same, even though they 
involve different kinds of r’s and different kinds of c’s. 


EXAMPLE 2. Find the general solution of the differential equation 


The characteristic equation is 2? + 4t + 13 = 0. Using the quadratic for- 
mula, we find the roots 

—44+/16—4-13  —44+ V/—36 

Fi; (EE 2 ew Sere anere aeiaiees aah Sa oe Pea he eaeenees 


2 2 
= 2 + 31. 


Hence, by (4), the general solution can be written 
y= cer Bis aaah 


where c; and c. are complex conjugates of each other. Unless otherwise 
stated, however, the solution should appear as an obviously real-valued 
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function. That is, it should be written without the use of complex numbers as 
in (5). Hence the preferred form of the general solution is 


y =e *(k, cos 3x + ky sin 3x). 


We now consider the remaining possibility. 


Case 3. The characteristic equation #? + at+ b= 0 has only one 
root r. In this case, we have ¢? + at + b = (t — r)(t — r), and the qua- 


dratic formula yields r = — ; and \/a? — 4b = 0. 
Theorem (8.2) is still valid, of course, and so one solution of the differ- 
ee d 
ential equation = + af + by = 0 is obtained by taking y = e”*. We 
x x 


shall show that, in the case of only one root, xe” is also a solution. Setting 
p= xe?) we obtain 


ss =e” + xre™ = e'(1 + rx), 
d’y rx bi 
inna the 225 Oe 
ex re’ (2 + rx). 
Hence 
d°y dy rx 72x 7x 
fa t 8g, + by = re" + rx) + ae"(l + rx) + bxe 


ar + x + a ere bx) 
= eIx(r’ + ar + 6b) + (a + 27). 
Since ris a root of #2 + at + b, we know that r? + ar + b = 0. Moreover, 


we have seen that r= — s and soa-+ 2r = 0. It follows that the last 


expression in the above equations is equal to zero, which shows that the 
function xe’” is a solution of the differential equation. 
Thus e” is one solution, and xe’” is another. It follows by (8.1) that, for 
any two real numbers c; and co, a solution is given by 
y = cixe"™ + coe” = (C1x + C2)e"”. 


Conversely, it can be shown that if y is any solution of the differential equa- 
tion (1), and if the characteristic equation has only one root r, then 
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for some pair of real numbers c; and cy. The general solution in the case of a 
single root is therefore given by (6). 


EXAMPLE 3. Find the general solution of the differential equation 9y"" — 
6y’ + y = 0. Here we have used the common notation y’ and y” for the 
first and second derivatives of the unknown function y. Dividing the equation 
by 9 to obtain a leading coefficient of 1, we get py” — 3y’ + gy = 0, for which 
the characteristic equation is *® —2f+4=0. Since & —2t+ 5 = 
(t — 4)(t — 4), there is only one root, r = 3. Hence 


x/3 


y = (c.x + en)e 


is the general solution. 


a 


The solution of a differential equation can be checked just as simply as 
an indefinite integral, by differentiation and substitution. 


PROBLEMS 


1. Find the general solution of each of the following differential equations. If 
the characteristic equation has complex roots, write your solution in trigono- 
metric form. 


d d 
(a) 2 + 3y = 0 (hy) =F +4y = 0 
dy 
(b) y’ = S5y (i) Fo ey ak 
dy dy a ale 
Ce ae y= 0 Dr pet slap 
d'y dy x dy dy _ 
Oe a a he 
ry 2 
(ec) y+ 8y' + 16y = 0 ee 
dx? ax 
2 
a ae (a) a a ae = 0 
dx? dx 
(g) y’ — Ty’ = 0 (n) y’ + 14y’ + 49y = 0. 
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2. (a) Find the general solution of the differential equation 


(b) Find the particular solution y of the equation in part (a) with the property 
d 
that y = 2 and = = 9 when x = 0. (Hint: Use these two conditions to 
a 


evaluate the arbitrary constants which appear in the general solution.) 
3. Find the particular solution y of each of the following differential equations 


such that y and 2 have the prescribed values when x = 0. 
a 


(a) ee = 2 hen x = 0 
ie ge i eee 
Pe 
(b) 3% +10 +y=0, y = land = “when x = 0. 
d’y dy dy 
(c) st y = 3and— = 6when x = 0. 
dy dy dy 


(e) y’+ 3y’+5y=0, yO) = 2and y’(0) = 6. 


4. Show by differentiation and substitution that e®*sin8x is a solution of 


2 
ov ag y= Oe ee 
dx’ dx 


5. (a) Show that e? = e?. 


(b) Use (a) and Problem 6, page 334, to show that ce? + Ce? is real for any 
complex numbers c and z. 


Vai + b? 
6. (a) Multiply acos x + dsinx by ee a and hence show that it can be 
a 
mee a 
written in the formc sin(x + k), where c = Va? + B2, sink = —= ; 
h Vai + be? 
and cos k = ————— 
a? b 
Va + b? 


(b) Multiply acosx + bsin x by and hence show that it can be 


Va? + b? 


written in the form c cos(x + k), where 


= Jah cos SS 


C2N/8 FP sink = 
Va? + b? Var + Bb? 
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iz 


10. 


(a) Show that y = cye**sin(Bx + ce) is a solution of the differential equation 


y” — day’ + @? + By = 


(b) Show that y = cye**cos(Bx + c2) is also a solution of the differential 
equation in (a). 


. For what value or values of r is e”” a solution of the third-order linear differ- 


ential equation y’’”’ — 6y’’ + 5y’ = 0? 


. (a) Solve the homogeneous differential equation 


d'y dy = 
Te 8 =e + 12y = 0. 
(b) Substitute the linear polynomial Ax + B for y in the nonhomogeneous 
differential equation 
d y dy 


js ss Se 


Hence find values of A and B for which this polynomial is a particular 
solution of the differential equation. 


(c) Show that the function which is the sum of the solutions found in (a) 
and (b) is also a solution to the one equation in (b). 


If, is a solution -of - +a —- by = 0 and y, 1s° a solution ‘of 
2y 

oe == @ a, by = h(x), show mee yr + Yp is also a solution of 

a 


+ aS + by = his) 
ax 


oPAPTER 7 


Techniques of Integration 


We have found indefinite integrals for many functions. Nevertheless, 
there are many seemingly simple functions for which we have as yet no 
method of integration. Among these, for example, are In x and \/a? — x?, 
since we have no way of finding [In x dx and [\/a? — x? dx. In this chapter 
we shall develop a number of techniques, each of which will enlarge the set of 
functions which we can integrate. 


1. Integration by Parts. One of the most powerful methods of integration 
comes from the formula for finding the derivative of the product of two 
functions. If uw and v are both differentiable functions of x, we recall that 


d dv du a 
— WU =.u-- +v—.. AS a consequence, of course; 1t “1S. hue that 
dx dx dx 
dv d du = 
u— = — uw — v—. From this last:equation it follows that 
: ee dx 


dv d du 
fu ax Bee | (dw — 0 dt) ax 


Since the integral of a difference is equal to the difference of the integrals, the 
last equation is equivalent to 


dv d du 
[ut ax = | Ly ax — [ota 


d ; 
But / - (uv) dx = uv + c. Hence, leaving the constant of integration as a 
Sax 


by-product of the last integral, we obtain 


(1.1 ) 


This is the formula for the technique of integration by parts. To use it, 
the function to be integrated must be factored into a product of two functions, 
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d 
one to be labeled uw and the other = . If we can recover v from “ , then we 
d d “ 
hope that | v = dx is easier to find than | ue dx. As we shall see, the 


trick is to find the right factorization. Sometimes it is obvious, and some- 
times it is not. 


EXAMPLE I. Integrate 
(a) | In x dx, (b) | x sin x dx. 


d 
For (a), consider the factorization Inx-1, and let u = Inx and —e f. 
d | , 
Then = = — and v = x (we normally do not concern ourselves with a 
<x 


constant of integration at this point—we look for some v, not the most general 
v). Integrating by parts, we have 


Jinx 1-ae = cin xyox) — J x-4 ar, 


a et — 
dv du 
u Ae jf ee © Vv ae 


Or 


[in xa =ixin x — [ax 


=xInx—x+e. 
If this is the correct answer, its derivative will be the integrand. Checking, 
Wwe get d ' 
ee ee ee ees 1+ 0 


Inx+1-—1=Inx. 


dv 
For (b) we have choices. We can try letting uv = x and = sin Xx, 
x 


dv 
dx 
and error shows that the first suggestion works and the others do not. If 


5 dv : 
Or we can try uv = sin x and — x, Or even u = x sin x and ee Page Seb 2 
x 


d d 
a = x and oe sin x, then we have — 1 and v = —cosx. The in- 
dx dx 


tegration-by-parts formula implies that 


iE sin x dx = (x)(—cos x) — | cos x1) dx, 
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Or 


| xsin x a = —xcosx + [cos x dv 


—xcosx+sinx+ ec. 


Although it is-not true that every function can be written as a product of 
functions which will lead to a simplification of integrals—this technique is of 


Ce dv = 
little use if | v c= dx is not easier to integrate than | u = dx—it is true that 
x x 


many can and that they can be integrated as were the two integrals in Ex- 
ample 1. The problem is to find the best function to call u. Sometimes the 
technique of integration by parts must be used more than once in a problem. 


EXAMPLE 2. Integrate [(3x? — 4x + 7)e**dx. If we let u = 3x2 — 4x +7 


d d, 
and spe e** then eg a 6x — 4andv = $e”. 
dx dx 


| Gx + 4c 4 De = Ge = 40> ee ; (te" 6x —4)dx 


= ae Ee | (3x — 2)e*dx. 


This last integral again involves a product of a polynomial and e*”, so we 


d d 
apply the technique again. Let uw, = 3x — 2 and =e". Tien = a 
x x 


and v; = se””. Thus 


| Ox = De dk = Lis Oe aS | (22° )G) dx 


= 16x — De" = | eas 
= VOe= Dye 208s ei. 


Substituting, we have 


[ox — Ax + Tje“"dx 
= eh Me ie = see ae ee 
= [x° — 2x + $- x +14+ PVYe"]+e 


= GX — get We +e. 
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Generally, faced with the product of a polynomial and a trigonometric 
or exponential function, it is best to let the polynomial be w and the transcen- 


d ; ey 
dental function be = . In this way, the degree of the polynomial is reduced 
x 


by one each time the product is integrated by parts. However, faced with a 
product of transcendental functions, the choice may not be quite so obvious. 


EXAMPLE 3. Integrate {e*cos 3x dx. Let us select ec?” as u and cos 3x as 


d du 
Then an 2e** and v = 4sin 3x. Thus 


| e’*cos 3x dx 


At this point a second integration by parts is necessary. The reader should 
check that the choice of sin 3x for u, will lead back to an identity. We choose 


d du 
e’* for u, and sin 3x for “! Then — = 2e and v, = —% cos 3x. Hence 
dx dx 


(e**)(4 sin 3x) — | (4 sin 3x)(2e”") dx 


gevsin 3x — gfe sin 3X Ox: : 


| esin 3x dx = (e°”)(—4 cos 3x) — | (208 3x)Ce “dx 
= —1ecos 3x + 3 | e*cos ax x. 
Substituting, we have 
; e’"cos 3x dx = 4esin 3x — a — Bete 3x + 2 | e**cos 3x ax) 
or 


2 or . 
| e008 3x dx - 1¢”*sin 3x + 2e’ ‘cos 3x — gfe "COS 38 ax. 


This does not look much simpler, since we have found an integral in terms of 
itself. But, if we add =)" cos 3x dx to each side of the equation (supplying 
the constant of integration at the same time), we have 


a 
gfe "Cos 3x ax = —C sin 3x + 2cos 3x) + ¢1. 
Finally, therefore, 


22 
| eos 3X 0x = = (3 sin 3x + 2cos3x) +c. 
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If we had chosen cos 3x as uw in the first integration by parts and sin 3x as u, 
in the second integration by parts, the integral could be found in the same 
way that we just found it. 


The differential of a function was introduced in Section 6 of Chapter 2 
and was shown to satisfy the equation dF(x) = F’(x) dx. Asa result, the 
symbol dx which occurs in an indefinite integral f f(x) dx may be legitimately 
regarded. as.a .differential.since, if: F’(x) = f(@),. then dF(x) = f(x) dx. 
Moreover, if u is a differentiable function of x, then 


dF(u) = F'(a) du = f(u) du, 
SO we write 


F(u)+ec= | 0 au 


[see (6.7) on page 216]. Using differentials, we obtain a very compact form 


d 
for the formula for integration by parts. Since du = ie dx and dv = = dx, 
es dx dx 
substitution in (1.1) yields 


CLI) 


We have less to write when we use this form of the formula, but the result 
is the same. 


EXAMPLE 4. Integrate ix inxax H- we we th?) we set w= nx and 


l 
du = xdx. Then ds = —dx andv = 3%. Hence 
x 


iE In x dx = (Inx)(4x’) — Jax) (: ax) 
os Eyl x= 1 | x av 
= 4x'Inx—di +e. 
In the remainder of this chapter we shall take full advantage of the 


streamlined notation offered by the differential and shall use it freely when 
making substitutions in indefinite integrals. 
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EXAMPLE 5. Find f x In(x + 1) dx. This example illustrates the fact that a 
judicious choice of a constant of integration can sometimes simplify the 


dx 


computation. Set u = In(x + 1) and dv = xdx. Then du = — | 
: x 


and we may take v = 2 . If we do this, we have 


a 


Je 
| x1nex + 1) dx = In(x + i= ie ie a 


l — 
ae Hae ee ene 
= 3x In(x + 1) ii fk 


x? 


l 
Dividing, we find — = =X = 1 eer 


, and hence 


ope _l - nee 

| snc + Ll) dx = $xInXx + 1) Ff (x + <4) a 
= 1y7In(x + 1) — 4X? — x + In[x + 1/) + 
= 4(x*— 1IIn(fx+1)—t°+4x4c. 


[Since In(x -+ 1) makes sense only if x + 1 > 0, we have replaced In|x + 1| 

by In(x + 1).] The same result is reached more quickly if we take 
x" x+k 

v= oy + c= 


. Then integration by parts gives 


x ok = | ae aa 
2 z x+ 1 
Since we have a free choice for k, we shall choose k = —1 and then 


2+k | ee = 
ee The problem becomes 


iE In(x + 1) dx = In(x + 1) dx. 


metas - [tte 
Nx 5) 9) xX 
4(x” — 1) In(x + 1) — $x° + dx +c. 


iE In(x + 1) dx 


The method of integration by parts can frequently be used to obtain a 
recursion formula for one integral in terms of a simpler one. As an example, 
we derive the useful identity 


(1.2) For every integern > 2, 


| cos’x dx = 


cos” ‘x sin x 
n 


+ nas | cos"—*x dx. 
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Proof. We write cos"x as the product cos"—'!x cos x, and let u = cos"—'!x and 
dv = cosxdx. Then v = sinx and du = (nm — 1)cos"~*x (—sin x dx) = 
—(n — 1) cos"—*x sin x dx. Hence 


cos ax = cos* gsm x°= [si x[—(n — 1) cos”~*x sin x dx] 


= cos”—!x sin x + (n — 1) | eos" *x sin’x ay. 

Replacing sin*x by 1 — cos?x, the equation becomes 
| cos"x dx = cos"—!x sinx + (n — 1) | cos" *x (1 — cos*x) dx 

= cos”—!x sin x + (n — 1) [eos dx — (n — 1) | cose ax. 
Adding (n — 1)fcos”x dx to both sides of the equation, we ae 

n | cos"x dx = cos”—!x sin x + (n — 1) | cos" “y-ax, 
whence (1.2) follows at once upon division by n. 

Thus by repeated applications of the recursion formula (1.2), the 


integral fcos"x dx can be reduced eventually to a polynomial in sin x and 
cos x. If is even, the final integral is 


[cos!x ax = fac-x+e 


[cos x ax = sinx+c. 


and, if n is odd, it is 


EXAMPLE 6. Use the recursion formula (1.2) to find {cos®2x dx. We first 
write {cos’2x dx = 4fcos®2x d(2x) and then 


4 : 
| cos 2x'dQxy= S08 FSi 4 : | cos’2x d(2x). 


A second application of the formula yields 


9 : 
| cos’ 2x d(2x) = cos exon + - | cos 2x d(2x), 


and, of course, 


[co 2X OZ2X) = Sin 2k FP A. 
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Combining, we have 


1 eats sin2x ..4 pa2 sin 2x 
Ss ) 


J cos'ax dt = 5) eae + - (sin 2x + ef 


4 : 72 : : 
= + cos'2x sin 2x + 3% cos 2x sin 2x + 7 sin2x + c. 


PROBLEMS 


1. Integrate each of the following. 


(a) f. cos x dx (f) [oe — 7x + 2)sin x dx 
(b) ; (x + 2)e"dx (g) | e**sin 2x dx 

(c) | arctan x dx (h) | sec x dx : 

(d) | x’sin 7x dx (i) | x'In(x + 1) dx 

(e) | x’ In x dx (j) | In(x” + 2) dx. 


2. Do Example 3 again, first letting u = cos 3x and then uw = sin 3x. 
3. Find formulas for 


(a) | e“*cos bx dx (b) | e”*sin bx dx. 


4. Derive the recursion formula analogous to (1.2): For every integer n > 2, 
sin” “xcosx n—1 
sin’x dx = — ee + a_i / sin” “x dx. 


5. Evaluate 


(a) | cos x dx (b) | sin’x dx 


by the recursion formulas [see (1.2) and Problem 4], and also using the trigono- 
metric identities 
cos’x = $(1 + cos 2x), 


sin?x = $(1 — cos 2x). 


Show that the results obtained are the same. (The preceding identities can be 
read off at once from two more basic ones: 


1 


Cos 26° = cos’x —. sin*X.) 


cos?x + sin’x 
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6. Use integration by parts to find recursion formulas, expressing the given integral 
in terms of an integral with a lower power: 


(a) Show that | x"e"dx = x"e* —an | x? ede: 


(b) Show that ’ sec x dx = —— =e a= | sec” x dx. 
(c) Find a reduction formula, expressing | (Injax + b|)"dx in terms of 
| colax + b|)"— dx. 
7. Use the formulas derived in (1.2) and in Problems 4 and 6 to find 


(a) | seas (d) ioe -+{- 7|)°dx 
w /2 
(b) | sin’x dx (e) | sin’x dx 
0 
1 
(c) | cos’ 5x dx (f) | xe'dx. 
0 


2. Integrals of Trigonometric Functions. Products of trigonometric functions, 
powers of trigonometric functions, and products of their powers are all 
functions which we need to integrate at various times. In this section tech- 
niques will be developed for finding antiderivatives of the commonly en- 
countered functions of these types. 

The first and simplest occur with the integrals 


[co ax cos bx dx, 
| sin ax sin bx dx, Cr 


| sin ax cos bx dx, in which a + b. 


None of these can be integrated directly, but each of the three integrands is 
a term in the expansions of cos(ax + bx) and cos(ax — bx) or in the ex- 
pansions of sin(ax + bx) and sin(ax — bx). We can use these addition 
formulas to change products to sums or differences, and the latter can be 
integrated easily. 


EXAMPLE |. Integrate: 


(a) | si 8x sin 3x dx, (b) | si IE 00S 2x ax. 
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The integrand sin 8x sin 3x in (a) is one term in the expansion of 
cos(8x + 3x) and also in the expansion of cos(8x — 3x). That is, we have 


cos(8x + 3x) = cos 8x cos 3x — sin 8x sin 3x, 
cos(8x — 3x) = cos 8x cos 3x + sin 8x sin 3x. 
Subtracting the first from the second, we get 
cos(8x — 3x) — cos(8x + 3x) = 2 sin 8x sin 3x. 
Hence, since 8x — 3x = 5x and 8x + 3x = 11x, we obtain 
sin 8x sin 3x = $(cos 5x — cos 11x), 


and so 


| si oy sin Sx gix = 1 | (cos 5x — cos 11x) dx 


— ees 
79 Sit 5x g5 sin 11x + c. 


For the integral in (b), we use the formulas for the sine of the sum and 
difference of two numbers: 


sin(7x + 2x) = sin 7x cos 2x + cos 7x sin 2x, 
sin(/x — 2x) = sin 7x cos 2x.— cos 7x-sin 2x. 
Adding, we have 
sin(7x + 2x) + sin(7x — 2x) = 2 sin 7x cos 2x. 
Hence 
sin 7x cos 2x = $(sin 9x + sin 5x), 


and 


| si 1300s 24 0x-= 2 (in 9x + sin 5x) dx 


eae 1 1 
= —7 Gos 9x — 7a C08 dx Hc. 


It should be clear that, using the formulas for the cosine and sine of the 
sum and difference of two numbers as in Example 1, we can readily evaluate 
any integral of the type given in equations (1). 


We next consider integrals of the type 


| cos” x sin" x dx, (2) 
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in which at least one of the exponents m and n is an odd positive integer (the 
other exponent need only be a real number). Suppose that m = 2k + 1, 
where k is a nonnegative integer. Then 


cos™x sin"x = cos?*t1x sin”x 
= (cos’x)* sin"x.cos x. 


Using the identity cos’x = 1 — sin’x, we obtain 


; a 
[cos i Sax = [a — sin’x)” sin”x cos x dx. 


The factor (1 — sin?x)* can be expanded by the Binomial Theorem, and the 
result is that {cos”x sin”x dx can be written as a sum of constant multiples 
of integrals of the form |sin?x cos x dx. Since 


fas 
| sintx 00s x ax = PA Be x+re ifg ~ —l, 
In|sin x| + c ie Le 
it follows that fcos”x sin”x dx can be readily evaluated. An entirely anal- 


ogous argument follows if the exponent n is an odd positive integer. 


EXAMPLE 2. Integrate 


(a) | cos'ax ax, (b) | sin’ cos'x dx. 
The integral in (a) illustrates that the method just described is applicable 


to odd positive integer powers of the sine or cosine (i.e., either ™ or n may be 
zero). We obtain 


| cos%as = | cos'ax cos 4x dx 


fo — sin’4x) cos 4x dx 


[co 4x dx — | sinvas cos 4x dx 
Pe ae oe 
= 4sin 4x — 75 sin'4x 4+ c. 


In (b) it is the exponent of the sine which is an odd positive integer. 
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Hence 


: ae y . 
| sin’x cos x Ux "= | (sin’x)” cos*x sin x dx 


DiS, : 
Jo — cos x) cos x sin x dx 


Jo = 2e@0g x 4. cos X)ices x.510,.x.0x 


gets : 
[co x sinxdx — 2 | costs sin x dx + | cos's sin x dx 


7 
—1cos*x + 2cos'x — $cosx +c. 


The third type of integral we consider consists of those of the form 


ae 


| con, x-Silt Rx, (33 


in which both m and n are even nonnegative integers. These functions are not 
so simple to integrate as those containing an odd power. We first consider 
the special case in which either m = 0 or n = 0. The simplest nontrivial 
examples are the two integrals fcos?x dx and Jsin?x dx, which can be 
integrated by means of the identities 


cos’x = $(1 + cos 2x), 
sin’x = $(1 — cos 2x). 


These are useful enough to be worth memorizing, but they can also be derived 
quickly by addition and subtraction from the two more primitive identities 


mae, 2 eee a) 
| = cos*x + sin*x, 
cos 2x = cos’x — sin’x. 


Evaluation of the two integrals is now a simple matter. We get 


[costs ax = 3} + cos 2x) dx = * 4 7sin 2x + é 
| sin’ a = sf — cos 2x) dx =5- sin 2x + é: 


Going on to the higher powers, consider the integral f cos’'x dx, where 
i is an arbitrary positive integer. We write 


cos’'x = (cos’x)’ = [4(1 + cos 2x)]’ 


tl + cos 2x)’. 


366 TECHNIQUES OF INTEGRATION [cuHaAp. 7 


The factor (1 + cos 2x)’ can be expanded by the Binomial Theorem. The 
result is that cos?*x can be written as a sum of constant multiples of functions 
of the form cos’2x, and in each of these j < 27. The terms in this sum for 
which j is odd are all of the type already shown to be integrable. The terms 
for which j is even are of the type now under consideration. However, the 
exponents j are all smaller than the original power 27. For each function 
cos’2x with j even and nonzero, we repeat the process just described. Again, 
the resulting even powers of the cosine will be reduced. By repetition of 
these expansions, the even powers of the cosine can eventually all be reduced 
to zero. It follows that, although the process may be a tedious one, the 
integral {cos®’x dx can always be evaluated. The argument for fsin?’x dx 
is entirely analogous. 


EXAMPLE 3. Integrate [sin°2x dx. We write 
sin°2x = (sin?2x)? = [4(1 — cos 4x)} 
= 4(1 — 3 cos 4x + 3 cos"4x — cos*4x) 
= g[1 — 3cos4x + 3(1 + cos 8x) — cos*4x] 
= 75 — $cos4x + 7% cos 8x — 4cos*4x. 


Hence 


= “ee 3 7 
J sintax ax = 5% = sin 4x + 5g Sin 8x — 7 cos 4x dx. 


In Example 2 we have shown that 


| cos'ax dx = 4sin 4x — 54 sin "Ax + ¢. 


We conclude that 


| sint2s dx —— sin 4x 7 5g sin f= : sin 4x +f gp g sin "Ax +e 


Ps. 32 
5x = 3 
= {6-20 pg Peis = 5 


Returning to the general case, we can now integrate [cos’x sin”x dx, 
where m and n are arbitrary nonnegative even integers. For, setting m = 2i 
and n = 2/7, we can write 


cos™x sin"x = cos”'x (sin?x)’ 


cos”*x (1 — cos?x). 
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When expanded, the right side is a sum of constant multiples of even powers 
of cos x, and we have shown that each of these can be integrated. This 
completes the argument. Actually, if neither mm nor n is zero, we can save time 
by using the identity 


ai yCOs X = % Sin 2x; 


as illustrated in the following example. 


EXAMPLE 4. Integrate [cos‘x sin?x dx. Since sin’x is the factor with the 
smaller exponent, we write 


cos‘x sin’?x = cos’x (cos’x sin*x) 
= cos’x (sin x cos x)’ 
= [4(1 + cos 2x)](4 sin 2x)’. 


Expanding, we get 
cos‘x sin’?x = 4(1 + cos 2x) sin’2x 
= ¢sin?2x + % sin’2x cos2x 


= 4.(1 — cos 4x) + | sin’2x cos 2x. 


Hence 
| costs sa xdx = is dx = is | c0 4x dx + 5 | sin’2x cos 2x dx 
So ee Boe 
= ag Sin 4x + Ze sin'2x + ¢. 


An important alternative method for integrating positive integer powers 
of the sine and cosine is by means of recursion (or reduction) formulas. In 
Section 1 [see (1.2), page 359], such a formula was developed, expressing 
fcos"x dx in terms of fcos”~*x dx. Following the derivation, Jcos®2x dx 
is evaluated with two applications of the formula. A similar reduction for- 
mula for J sin”x dx was given in Problem 4, page 361. Certainly no one should 
memorize these formulas, but, if they are available, they undoubtedly provide 
the most automatic way of performing the integration. 


We next turn to the problem of evaluating 


| tan’x dx, (4) 
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where n is an arbitrary positive integer. For n = 1, the integral is an ele- 


mentary one: 
—In|cos x| + ¢ 
sin x , 
[tan x ax = [8 ax = 
cos x 


In|sec x] + c. 


For n => 2, there is a reduction formula, which is easily derived as follows. 
Using the identity sec’x — 1 = tan’x, we have 


| tan”x dx = | tan” —*x tan’x dx 
os | tan” “x (sec’x — 1) dx 


= | tan”—*x sec’x dx — | tan”—*x dx. 


Since re tan x = sec’x, the first integral on the right is equal to 
i: 


| tan”—*x d tan x. 


Hence we obtain 


(2.1) | tan”x dx = a ta x | tan” *x dx. 


However, we generally perform such integrations without explicit use of the 
reduction formula (2.1). We simply carry out this technique of replacing 
tan’x by sec?x — 1 as often as necessary. 


EXAMPLE 5. Integrate ftan>x dx. Factoring and substituting, we get 


f tan’x dx = / tan’x tan’x dx 


| tan°x (sec’*x — 1) dx 


| tan°x sec’x dx — | tan’x dx 


+ tan*x — | tan x (sec’x — 1) dx 


z tan [an x sec’x dx a [ran x dx 


= +tan’x — $tan’x + In|sec x| +c. 
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The difficulty in evaluating the integral 
| sec” x dx, Cc 
where n is a positive integer, depends on whether uv is even or odd. Ifn = 21, 
for some positive integer /, then 


sec"x = (sec?x)’'sec?x = (1 + tan?x)’'sec?x. 


Hence, if 7 is even, sec”x can be expanded into a sum of multiples of integrals 
of the form 


| tan’x sec’x dx = Sores tan’t*x + ©. 
jt+1 


If n is odd, the problem is not so simple. We shall use the reduction formula 


n—2 
(22) | fsecrxae = Sot 


This formula is derived by integration by parts [see Problem 6(b), page 362] 
and is applicable for any integer n > 2, whether even or odd. With a finite 
number of applications, fsec”x dx can therefore be reduced to an expression 
in which the only remaining integral is fdx or fsec x dx, according as n is 
even or odd. Hence, if n is odd, we need to know {sec x dx. An ingenious 
method of integration is to consider the pair of functions sec x and tan x 
and to observe that the derivative of each one is equal to sec x times the other. 
Writing this fact in terms of differentials, we have 


g@sce x = sec x tan, x dx, 


dtan x = sec’?x dx = sec x sec x dx. 
Adding and factoring, we obtain 


d(sec x + tan x) = sec x(tan x + sec x) dx. 
Hence 


sec X 0x = aie sea te tan x) ) 
sec x + tan x 


from which follows the useful formula 


€2:3 ) | sec x dx = In|jsec x + tanx| +c. 
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EXAMPLE 6. Integrate fsec’x dx. Using the reduction formula (2.2) twice, 
we have 


31 
| sects Fata ee see 3 | sects dx 


BS 
sec x tan x 3 fsecxtanx 1 
- Sextene 5 3( 5 +} | scxas). 


From this and (2.3), we conclude that 


+ = In|sec x + tanx| +c. 


3 
sec’x tan 3 sec x tan 
| sete de = SES, 


Of course, the integration of f cot”x dx parallels the technique for in- 
tegrating f tan"x dx, and the integration of f csc"x dx parallels that for 
f sec”x dx. The reduction formula corresponding to (2.2) is 


The last type of integral to be discussed consists of those of the form 


| sec”’x tan” x dx, (6) 


where m and n are positive integers. There are a number of variations, depend- 
ing on whether each of m and n is even or odd. We shall consider three cases: 


Case I. mis even. Then m = 2k, for some positive integer k. Hence 


2 
/ sec’”’x tan"x dx = | sec’*x tan”x dx 


Al i EEE. fe 
| sec’*—*x tan”x sec’x dx 


2 c—L ie 
| (sec’x)*—* tan”x sec’x dx 


| (1 + tan’x)*—" tan”x sec?x dx. 


We can now expand (1 + tan’x)*~', and the result is that the original 
integral can be written as a sum of constant multiples of integrals of the 
form J tan’x sec?x dx. As we have seen, each of these is equal to fuidu, with 
u = tan x, and is easily integrated. 
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Case 2. n is odd. Then n = 2k + 1, for some nonnegative integer k. 
We write 


2 1 
| sec”x tan"x dx = | sec” tan’ +*x dx 
m—l So. AK 
= | sec. xian x) sec xian x.dx 
m—l 2 k 
~ | sc x (see x — 1)" sec x tan x dx. 


Again we expand by use of the Binomial Theorem. In this case, the original 
integral becomes a sum of constant multiples of integrals of the form 
f sec’x sec x tan x dx, each of which can be integrated, since 


a 


| sec’x sec x tan. x dx = | sec’x d(sec x) 


+i sec? thy + ¢. 


Case 3. nis even. Thenn = 2k, for some positive integer k. In this case, 
we have 


ok 
| sec”’x tan”x dx = | sec’”’x tan’ x dx 


= | sec”’x (tan’x)"dx 


4 sec”x (sec’x — 1)“dx. 


This time, if we expand the integrand, we get a sum of constant multiples of 
integrals of the type | sec’x dx, and we can use the reduction formula (2.2) on 
each of them. 

The three cases discussed are not mutually exclusive, and one may have 
a choice. For example, if m is even and n odd, the integral may be found by 
the techniques of Case | or that of Case 2. If m and n are both even, either 
the techniques described in Case 1 or Case 3 may be used. 


EXAMPLE 7. Evaluate the integrals 


(a) | sec’x tan’x dx, (b) | sec’x tan’x dx. 
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For (a) we write 


4 2 6 2 
[sec x tan’x dx = [sec x tan’x sec’ x dx 


i + tan’x) tan’x sec’ x dx 


2, 2 
-- | tan’x sec’x dx + | tan®x sec’x dx 
7 
= Ltan'x + dtan’x4 c. 


It is also possible to evaluate this integral by the technique described in Case 3. 
However, the resulting computation would be so much longer that it would 
be foolish to do so. 

For (b) we use the method of Case 2. Factoring, we get 


yi 


x 
| sec’x tan’x dx 


2 
| sec x tan’x sec x tan x dx 


me 2 2 
[sec x (sec’x — 1) sec x tan x dx 


2 4 2, 
| sec*s (ces — 2sec’x + 1)sec x tan x dx 
= | sects sec x tan x dx — 2 | sects sec x tan x dx 


2 
“Lb. | sects see x tan x ds 
45 7 2 5 1 3 
Seo a See Ao a eC x =e, 


We conclude with the remark that techniques for integrating 
fcse""x cot”x are analogous to those for [sec’’x tan”x dx. 


PROBLEMS 


1. Integrate each of the following. 


(a) | cos 5x sin 2x dx (e) | cos 3x sin 7x dx 
(b) [cos 3x cos x dx (f) cos 2ry sin wy dy 
(cj ; sin 3x sin x dx (g) | sin x sin 6x dx 


(d) | c0s Aveos 7202 (h) [00s Tw sin 17w dw. 


SEC. 2] INTEGRALS OF TRIGONOMETRIC FUNCTIONS 373 


2. (a) Integrate {sin*@ dé by using the fact that the exponent of sin @ is an odd 
positive integer. 


(b) Integrate /sin*@ dé by making use of the identity sin 30 = 3 sin 0 — 4 sin*6. 
(c) Show that the answers obtained in (a) and (b) differ by a constant. 
3. (a) Integrate {sin®2x dx by using the fact that the exponent of the sine is an 
odd positive integer. 


(b) Integrate [sin>2x dx using the recursion formula given in Problem 4, 
page 361. 


(c) Show that the answers obtained in (a) and (b) differ by a constant. 


4. Integrate each of the following. 


(a) | cos’ 2x dx (f) | / sin x cos’x dx 


(b) | cos*x dx (g) | cos’3x dx = 
(c) | sin’3x dx (h) | sin’y cos” ydy 

(d) | sin’x cos~-x dx (1) | cosy sin’y dy 

(e) | sin’6 cos 6 dé G) { sin’x (cos x)! "dx. 


5. (a) Integrate {cos?@ d@ using the identity cos’@ = $(1 + cos 26). 
(b) Integrate {cos?@ dé by parts. 
(c) Show that the answers obtained in (a) and (b) differ by a constant. 

6. Evaluate fsec?x tan x dx in two different ways: first using the fact that the 
secant has an even exponent, and then using the fact that the tangent has an odd 


exponent. Show that the two solutions differ by a constant. 
7. Integrate each of the following. 


(a) | tan’x dx (f) | sec’x tan’x dx 

(b) | tan’4y dy (g) | sec’x tan’x dx 

(c) | sec 6 dé (h) | sec x tan’x dx 

(d) | sec’ 2x dx (i) | sec’x\/tan x dx 
4 4 gx 

(e) [ x tan x dx G) | Re ee 
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8. (a) Let n > 2 be an integer, and derive a reduction formula for /cot”x dx 
analogous to (2.1). 


(b) Use the formula derived in (a) to integrate {cot®36 dé. 


9. By a method analogous to that used previously to find {sec x dx, prove that 


[esexas = —In|csc x + cot x| + ¢. 


10. (a) Use integration by parts to derive the reduction formula (2.4) for 
Jesce™x dx. 


(b) Use this formula to integrate {csc®y dy. 


11. Integrate each of the following. 


(a) | csc’6 dé (d) | csc'x cot’ x dx 
(b) ; sin 3x cot 3x dx (e) | csc’2y cot’2y dy 
(c) | cot’y dy (f) | csc’y cot’¢ de. 


3. Trigonometric Substitutions. In this section we shall study a technique of 
integration which is particularly useful for finding integrals of functions of 
v/a — x2, \/a2 + x2, and \/x? — a?. The technique is that of trigono- 
metric substitutions and is based on some of the elementary trigonometric 
identities developed in Chapter 6. We shall develop the method by doing 
specific examples. 


X-axis 


(0, |al) 


(V a*— x’, x) = ({a| cos @, |a| sin @) 


(0, —|a}) 


Figure 1 
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Consider the problem of evaluating { v/a? — x? dx. The domain of the 
function \/a? — x? is the closed interval [—|a|, |a|]. In what follows it will 
be convenient to view this interval as lying on the vertical axis, and, for this 
reason, the traditional positions of the x-axis and the y-axis will be inter- 
changed. For every real number x in the domain [—|al, |a|], the point 
(\/a? — x?, x) lies on the semicircle in the right half-plane with radius |a| 
and center at the origin (see Figure 1). It follows from the definition of the 
trigonometric functions sine and cosine on page 282 that this point 
(\/a2 — x?, x) is equal to ({a| cos 6, |a| sin @), where @ is the radian measure 
of the angle denoted by the same letter in Figure 1. Hence 


ees = |a| cos 0, 


=e Se 0, 


ee, 


We shall restrict 6 to the interval | - = 4 . Then @ is uniquely defined by 
equations (1), and, as @ takes on all values in this interval, x assumes all 
values in the domain of the function \/a? — x®. Using equations (1), we 


obtain dx = |a| cos 6 dé and 
[ve=2 a= fi cos 6: |a|cos @d@ = a | cos’ dé. 


Since cos?@ = 43(cos 26 + 1), 


2 
[ve= Pees * | (cos 20 + 1) dé 


2 
= (4 sin 20 + 6d) +c. 
ere 
But 6 = arcsin —— and 


la 
@— x2 a — x2 
+ sin 20 = sin cos @ = = eee ~ eS 
la} a a 


Substituting back, we get 


/ Ja — x2 dx 


9 
d+ 


‘ Decco 
a xV az — x? ; x 
5 ( — + arcsin zs) + c¢ 


S (xva? — 3? + a’arcsin a) + ¢. 


In general, with any integral involving \/a? — x2, we make the trigono- 
metric substitutions based on equations (1); i.e., we replace x by |a| sin 6, 
Va? — x? by |al cos 6, and dx by |a| cos 6d0. Note that there is an equiv- 
alent alternative procedure: We may set x = |a| cos @ and restrict 6 to the 
interval [0,7]. Then x can take on all values in the interval [—|a|, |a|] as 
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before, and, in addition, sin @ will be nonnegative. Since cos26 + sin’@ = lI, 
we will have 

Vat — x? = Va? — arcos?d = Va’sin2@ = |al sin 0. 


Thus the integral may be evaluated equally well using the equations 


| 


x = |a| cosy, 


(2) 


Vaz — x? = |al sin 8. 


Of course, if these substitutions are used, then dx = —|a| sin 6d6. Geo- 
metrically, equations (2) are obtained by starting from the point 
(x, \/a2 — x”), which lies on the semicircle in the upper half-plane instead of 
the right half-plane. 

A definite integral may be simpler to evaluate than an indefinite integral, 
since we may use the Change of Variable Theorem for Definite Integrals (see 
page 215) and thereby avoid the substitution back to the original variable. 


EXAMPLE |. Evaluate the definite integral 


2V3 ‘ 
| Se 
SA iG ae 
Using equations (1), we define 6 by setting x = 4sin 6 and \/a? — x? = 
4 cos 6. It follows that dx = 4cos@d6. If x = 2, then sin 6 = % and so 


a 3 
pe ee 2/3, then sin 0 = Vena = Lies Hence 
6 Zz 3 
2V3 9 Tt [3 fee 
| x"dx -{ 16 sin’ - 4cos 6 dé 
ee pee, EE 7/6 4 cos 6 
1/3 
= 16 | sin’6 dé. 
w/6 
Since sin’? = $(1 — cos 26), we obtain 
2V3 24 1/3 
dx 
a8) 1 — cos 26) dé 
I / 16 — x2 1 /6 ( ) 
T /3 
= (86 — 4sin 26) 
t /6 
= (s es Ae eee =o. oo 
3 3 6 6 
29 Bir = 4, ae Seat gat  hgBiig ae 
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Next we consider integrals involving \/a2 + x?. The domain of the 
function ./a? + x? is the set of all real numbers, i.e., the unbounded interval 
(—~«%, «#). Geometrically, we shall again find it convenient to place this 
domain on the vertical axis and to interchange the usual x-axis and y-axis 
in the picture. For every real number x, consider the point (|a|, x), and let 


(jal, x) 


Figure 2 


§ be the radian measure of the angle shown in Figure 2. It follows that 


*. =. altan 6, (3) 


= + x? = |al sec 0. 
. tees 
Big 
determined by x according to equations (3), and the interval (— «, «) of all 
possible values of x corresponds in a one-to-one fashion to the interval 


We restrict @ to the open interval (- ) . In this way, @ is uniquely 


(- : *) of all values of 6. It follows from the equation x = |a| tan 6 that 


dx = |a| sec?9 de. Hence, if we substitute |a| tan @ for x, we substitute 
la| sec 6 for \/a? + x? and |a| sec?@ dé for dx. 

Algebraically, the substitutions given by equations (3) arise from the 
trigonometric identity 1 + tan?@ = sec’9. If we set x = |a| tan 6, then x 
will assume all real values as @ takes on all values in the open interval 


T e e e e e 
(- 5? *) . Moreover, sec 4 is positive in this interval. It follows that 


Va + x2 = Var(l + tan2e) = Varsec?@ = |al sec 0. 


dx 
EXAMPLE 2. Integrate | = Letting x = |a| tan 6, we obtain 


Vets 
v/a? + x? = |al sec 6 and dx = |a| sec’6 dé. 


378 TECHNIQUES OF INTEGRATION  [cHApP. 7 
Hence 
2 
— = la] sec 6 dé = [sc 6 dé. 
Ja + x? |a| sec 0 
It was shown in Section 2 that 


[sec 6dé@ = In|sec 6 + tan 6| + c. 


Consequently, 
aS = In|sec 6 + tan é6| + c 
Vat + x? 
la| la ja| 
Using the properties of the logarithm, we may write 
In verte = In\Va2 + x? + x| — Infal. 


Since —In|a| + c is no more or less arbitrary as a constant than c itself, we 
conclude that 


| P= mivere tal +e 
a2 + x 


By trigonometric substitutions, functions of \/x? — a? may frequently 
be put in a form so that integration is possible. Since \/x? — a? is defined if 
and only if |x| > |a|, the domain of the function \/x? — a’, unlike the others, 
is the union of two intervals: (— «, —|a|] and [|a|, «). In this case we shall 
set x = |a| sec 6. Using the identity 1 + tan’@ = sec’@, we obtain 


If 6 is restricted to the interval [o, 2 , then tan @ is nonnegative, and, as @ 
takes on all values in this interval, x assumes all values in [|a|, 00). Similarly, 
if 6 is restricted to the interval |—r, — *) , then tan @ is again nonnegative, 
and, as @ runs through this interval, x correspondingly traverses (— #, —|a\] 
(in the opposite direction). Thus we have defined a new variable 6 by the 
equations 
x = ja sec @, 
| al (4) 
Vv 


x2 — a? = |al tan 6. 
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These equations can also be obtained geometrically. Figure 3 illustrates the 
situation for x > |a|. For every such x, consider the point (|a|, \/x? — a?) 
in the plane, and let 6 be the radian measure of the angle shown. Since x 
appears only as the hypotenuse of a right triangle, we have used letters other 
than x and y in labeling the horizontal and vertical axes. 


S-aXIs 


(la, V x*— a’) 


t-axis 


Figure 3 


It follows from x = |a| sec 6 that dx = |a| sec @ tan @d6. Hence if we 
make the trigonometric substitutions based on equations (4), we substitute 
la| sec 6 tan 6 dé for dx. 


EXAMPLE 3. Find the indefinite integral 


dx 
@ | — 


and evaluate the definite integral 


—3Ne 
(b) a4/ x2 = Fax. 
—6 
For part (a), we let x = |a| sec 6 and ./x? — a? = |a|tan6. Then 
dx = |a| sec 6 tan 6 dé and 


re 
| dx _ | lalsec@tan ode _ J = 


Jie. 2 + ae 


It was shown in Section 2 that 


[sc 6d = In\jsec 6 + tan6| + c. 


380 TECHNIQUES OF INTEGRATION [cHap. 7 


Hence 
peel = In|sec @ + tan 6| +c. 
s/ x2 — Q?2 
y2 — 2 
But sec 9 = 2 and tan @ = —— and, substituting back, we therefore 
a a 
obtain 
x2 — 
——* i a gas 
a/ Xt la| la| 
ef EO ees 2 
Since In ~ ++ al = In|x + a? — x?| — Inja| and since ¢ is an 
a a 


arbitrary constant, we may incorporate the term —In|a| into the constant of 
integration and conclude that 


| Pa- e+ v8 e+e 
x—a 


For the definite integral in part (b), we introduce 6 as the variable of 
integration by letting x = 3sec@. Then \/x?—9 = 3tan6 and 
dx = 3sec 6 tan @dé@. Restricting 6 to the interval |—r, _ | ;we-see that, 
when x = —6,sec 9 = —2andso6 = — = Similarly, when x = — 3/2, 
sec@é = —\/2 and@ = — : . With these substitutions and the Change of 


Variable Theorem for Definite Integrals, we obtain 


mas —3r/4 
| xou/ x2 — 9 dx | 27 sec’@- 3 tan 6-3 sec 6 tan 6 dé 


—6 —2r /3 


—37 /4 
243 | sec’@ tan’é dé. 


—27r/3 


To integrate this, we replace a factor of sec’@ in the integrand by | + tan’é. 
The integral then becomes 


—=§ V2 
| eras ee 
=$ 


—37 /4 
243 | (1 + tan’@) tan’@ sec’ dé 
—27 /3 


—37/4 


—37 /4 
243 | tan’@ sec’6 do + 243 | tan‘6 sec’ dé. 


—eris —27 /3 
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Since dtan 6 = sec’6 dé, it is easy to find antiderivatives of tan?@ sec?@ and 
tan*@ sec’@. Hence 


—3 V2 ie eee 
| x°/x2 — 9dx = 243(4 tan’ + 2 tan*6) 


a —27 /3 

243[(4-1+4-1)— 4:-3V3 4 4-9V3)] 
8 - 14/3 

aif 349°) 

81/8 — 42/3), 


| 


and the example is finished. 


Although the trigonometric substitutions developed in this section have 
been primarily directed at integrands containing certain square roots, we can 
equally well apply them to other functions of a? — x’, a? + x’, and x? — a’. 
Por example, if we let*x="(a) tan ?,* thera = x = a’(1 + tan’é#) = 
a’ sec’@ and dx = |a| sec?@d@. We then obtain 


de. = dilelstaniden, <1 ye a 
a? + x? — asec?9 «sa oe la| pe 


l x 
= cP arctan ae + c. 


dx 
a+ 


Given a choice, one would probably not evaluate | by this method. 


x? 
It is more likely that one would do the problem directly, remembering the 


fo mula [ ae arcta + 
r == arctan C. 
1+ x e 


We conclude this section with consideration of the integral 


| dx 

(ax? + bx + ce)" 
where 71 is any positive integer and the polynomial ax? + bx + cis irreducible 
over the real numbers. To say that a quadratic polynomial is irreducible 
means that it cannot be written as the product of two linear factors. It 
follows from the familiar quadratic formula that ax? + bx + c is irreducible 
over the real numbers if and only if b? — 4ac < 0. We shall show, by means 
of the trigonometric substitution used in the preceding paragraph, that the 
integral (5) can be changed to 


(5) 


K | cos-*6 dé, (6) 


where K is aconstant. This latter integral, as we saw in Section 2, can always 
be integrated, and this means that it is always possible to integrate (5). This 


382 TECHNIQUES OF INTEGRATION _ [cHAp. 7 


fact, which is of interest in itself, will play a part in a more general theory 
to be developed in Section 4. 
By first factoring and then completing the square, we obtain 


oe 


b? 


“fe | 


Note that, since b? — 4ac < 0, we know both that a #0 and that 


ax’ + tie ec 


See = b 
\/4ac — b? is real. For convenience, we shall let y = x + — and 
a 


| 2 
dae 
peo Mba Mica 


. ax? + bx +c =aQ’?4+ k’), 
and dy = dx. Making the trigonometric substitution y = |k| tan 0, we have 
ax?’ + bx +c = aly’? + k’) = ak*(tan’0 + 1) = ak’sec’#, 
and dx = dy = |k| sec’ d0. It therefore follows that 


x = | k|sec”0 dé - i | 
(ax? + bx +c)” J (aksec29)" = ak" J (sec @)2"—? 
K [cos*—*9 dé, 


| 
ak?” ; 
it into the integral of a power of a cosine by trigonometric substitutions and 
then by reducing the power of the cosine with the reduction formula on 
page 359. 


where K = 


Thus, every integral (5) can be integrated by first changing 


PROBLEMS 


1. Evaluate [\/a? — x? dx using equations (2), page 376, and compare your answer 
with that found using equations (1). 
2. (a) Write a set of equations for integrating functions of a? + x? which are 
analogous to equations (3), page 377, but are based on the identity 
1 + cot?é = csc’. 


(b) Select an interval to which @ can be restricted so that it is uniquely deter- 
mined by the equations in part (a) and so that x can take on all real number 
values. 


3. Evaluate [s Ja ; using the substitution described in Problem 2. 
a 
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. What is the set to which 6 should be restricted if the substitution of |a| csc 6 
for x makes x? — a? equal to |a|cot 6, defines @ unambiguously, and also lets 
x take on all real values such that |x| > jal? 

. Evaluate the following integrals. 


(a) | ese (f) | \/ (x2 + 4)3 dx 
(b) [ve ies (g) a 
see ae oY 


3 
(c) [vie +. 2-dxe (h) {a 


(d) | x/x2 — 4dx (i) | \/x2 — a2 dx 


dx | dx 
Ss i har aa 


. Evaluate 2 - a /a2 — x? dx, and hence find the area of a segment of height / 
in a circle of radius a. 
. Evaluate the following definite integrals. 


4 4 
dx dx 
| 


V5 4 
(b) i aS (e) | x/25 — x2 dx 
0 3 


V4 + x? 
d if? d 
xX xX 
. O/T Ax? 5/2 ~/A4x2 — 9 
. Integrate 
dx 
O ) 542 i) ane 
dx 
me fats ” —— re 
dx 
os / ef 2x $5 = pes G+ 9p 
dx 
(d) / dx? + 12x + 20 = | ate Eee 
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9. By the substitutions used to change equation (5) to (6) and by the reduc- 
tion formula on page 359, verify the following reduction formula (where 
b? — 4ac < 0): 


eee 
(ax? + bx +c) ( — 1Gac — B)@e + bx + ey 
Fee oo an | <a 
(n — 1)(4ac — b2) J (ax? + bx +c)" 


4. Partial Fractions. A rational function is by definition one which can be 
expressed as the ratio of two polynomials. A simple example is the function 
J, defined by 
f= aay = SS 
(x? + 1)(x— 2) x —2x2?+x-2 


for every real value of x except 2. At present, we have no way of integrating 
this function. However, in this section we shall develop a method of integra- 
tion which is applicable to any rational function. It is called the method of 
partial fractions. 

To illustrate the method, consider the equation 


1 a 5 


eo ee, ee 


which is easily seen to be true for all real values of x except 2. It follows that 


See eee 
(x? +1)x—2) 5x-—-2 5x24] 
wh bi 1 _* eae 
: ae eee oe ee Se 
Hence 
[pore ee 2 dx SS 
G?+1)(x-—2) 5J/ x-2 SJ x*4+1 SJ +1 
= 4In|x — 2| — Ay In@’ + 1) — Zarctanx +c. 
l 
Thus ————_————— can be integrated, since it can be written as a sum of 


(x? + I) — 2) 
simpler rational functions, each of which can be integrated separately. The 
method of integration by partial fractions is based on the fact that such a 
decomposition exists for every rational function. In this example, we have 
given no indication of how the decomposition is to be found. However, the 
general method, which we now describe, consists of just such a prescription. 
We begin with the result from algebra that it is always possible by means 


SEC. 4] PARTIAL FRACTIONS 385 


of division to express any given rational function as the sum of a polynomial 
and a rational function in which the degree of the numerator is less than the 
degree of the denominator. Stated formally, this theorem says that, if 
N(x) and D(x) are any two polynomials and if D(x) is not the zero function, 
then there exist uniquely determined polynomials Q(x) and R(x) such that 


(1) 


and such that the degree of R(x) is less than the degree of D(x). (The letters 
N, D, Q, and R have been used to suggest, respectively, the words 
‘numerator,’ ‘‘denominator,” ‘‘quotient,” and ‘“‘remainder.’’) The first 
step in the method of integration by partial fractions is to write the given 


N 
rational function s in the form of equation (1). Since we can obviously 
x 


integrate the polynomial Q(x), we need next consider only rational functions 
in which the degree of the numerator is less than the degree of the denomi- 
nator. If we start with such a function, then no division is necessary. 


— xt — 4x3 + 8x? — 7x + 3 
EXAMPLE |. Write the function Hise 22 ile WE oidicitionie as the sum of a 
x — 2x° + 3x — 4 
polynomial and a rational function in which the degree of the numerator is 
less than the degree of the denominator. Dividing, we have 


x — 2 
fe ee det nt — "Ax? ae Bee a3 
Re te Sy” Ay 


mus Dee Sort Ayes 
= Be oe Ay” = Hee oe 8 


x 3 AD. 
It follows that 


a x 


which gives the required sum. 


Another algebraic fact about polynomials, which we shall not prove, but 
shall assume and use, is that any nonconstant polynomial (1.e., of degree at 
least 1) with real coefficients can be written as a product of linear and quad- 
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ratic factors, each with real coefficients. By a linear factor we mean a poly- 
nomial L(x) of degree 1; that is, L(x) = ax+ banda+#0. Similarly, a 
quadratic factor is a polynomial Q(x) of degree 2; thus O(x) = cx? + dx + e 
and c ~ 0. The theorem states that, for any polynomial 


ID = GX ae SD 


with real coefficients a; and with n > 1 and a, + O, there exist linear 
factors L,(x),..., Lp)(x) and quadratic factors Q;(x),..., Q,(x) with real 
coefficients such that 


F(X) = Lil) ++ Lp (Qi) * + + Qa(). 


Note that either p or g may be zero. In actual practice, such a factorization 
of f(x) may be very difficult to find, but the theorem assures us that it exists. 
A polynomial is said to be irreducible if it cannot be written as the 
product of two polynomials each of degree greater than or equal to 1. The 
degree of the product of two polynomials is equal to the sum of the degrees 
of the factors, and it therefore follows that every linear polynomial is ir- 
reducible. It was pointed out in Section 3 that a quadratic polynomial 
cx? + dx + e is irreducible over the reals if and only if its discriminant 
d? — 4ce is negative. For example, the polynomials x? + land x2? + x-+ 1 
are irreducible, whereas x? + 2x + land x? + x — larenot. Ifa quadratic 
polynomial is not irreducible, it can be factored and written as the product of 
two linear polynomials. Hence the factorization of an arbitrary nonconstant 
polynomial into linear and quadratic factors, as described in the preceding 
paragraph, can always be done so that all the factors are irreducible. 
Returning specifically to the method of integration by partial fractions, 


N 
we consider a rational function te , with the degree of D(x) greater than 
x 
the degree of M(x). The second step is to write the denominator D(x) as a 


product of irreducible factors. Having done so, we have 


(2) 


where, for cach 7 oe] 5. .p, 
LAX) = ax + Bz a; ~ 0, 
ak, tor Gack = 
Q(x) = cj;x" + d;x + e;, d;" — 4cje; < 0. 


There is no reason to suppose that the factors which appear in equation (2) 
will all be distinct, and it may very well happen that L,(x) = L(x), etc. 
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However, the theory is simpler if no repetitions occur, and we shall consider 
that case first. 

Case 1. The irreducible factors of D(x) are all distinct. The algebraic 
theory of partial fractions, which we shall assume, tells us that we can write 
N(x) 
D(x) 
D(x) as its denominator and such that in each term the degree of the numer- 
ator is less than the degree of the denominator. Moreover, given the factor- 
ization of D(x) into irreducibles, this decomposition is unique except for the 
order in which the terms are written. Thus 


as a sum of rational functions each of which has one of the factors of 


OD ee a 
D(x) ax+t+bi ax t bp = 5 
Byx + Ci 


Ce + dx te, Cet + OX + e:" 


Pie nial sie fier 56 
CyX? 4 gk 7 Sg 


where each of the letters A;, B;, and C; represents a uniquely determined real 
constant. The rational functions which appear on the right side are called the 
N(x). 

D(x) 

We shall show by means of examples how the constants in the partial 
fractions decomposition are determined. Consider the rational function 
1 
x — 2) 
numerator, zero, is already less than 3, the degree of the denominator. More- 
over, the denominator is already factored into irreducibles. Hence, we seek 

constants A, B, and C such that 


partial fractions of the decomposition of 


discussed at the beginning of the section. The degree of the 


ase =e t 


Adding the two fractions on the right side, we have 


A(x® + 1) + (Bx + Cx = 2). 


Geo. -2° = =—9hlt + DC FD) 
The fact that a nonzero polynomial of degree n has at most n distinct roots 
implies that two rational functions with the same denominator are equal 
if and only if their numerators are equal (see Problem 8 at the end of this 
section). Hence the equation 


1 = AQ’ +1) + (Bx + OOH — 2) (3) 
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is true for all real values of x. There are two common ways to find the con- 
stants A, B, and C. One is to multiply and regroup the terms in (3) to obtain 
the equation 


G= (44 iy +e = es 7 — 5, 


which also holds for all real values of x. The only polynomial with infinitely 
many roots is the zero polynomial, i.e., the polynomial with no nonzero 
coefficients. Hence the three coefficients on the right side of the preceding 
equation are all equal to zero, and we can therefore find A, B, and C by solving 
the system of equations 


A aD. = 0, 
— 2 =, 
A ae eS 


Usually simpler is the technique in which we take advantage of the fact that 
(3) must be true for all values of x, and we choose values cleverly to help 
evaluate the constants. For example, letting x = 2 in (3), we have 


LS 40 2 3 ie 
= 54 


Hence A = 4. If we then let x = 0, we have 


1=210+1)+(B8-04+ 00-2 


= + — 2C, 
or, equivalently, 2C = 4 — 1, and so C = —. Finally, choosing x = 1, 
we get 
1=2(2+1D+ (8-1-0 - 2) 
‘ Se 5 So = 
from which we conclude that B = —2. Whichever method we use, we have 
A = 4, B = —4, and C = —2, from which it follows that 


es Lt Mespae 


the form which we integrated at the beginning of the section. 


EXAMPLE 2. Integrate | ; 5 3, ie oO = 2 = (a — Xa ee 
a". =—-X 


decompose —, ; into partial fractions and . Thus 
aa x CS 


l A B A(a — x) + Bla + x) 
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Equating numerators on the left and right, we get 
= A(a — x) + Bia x). 
Letting x = a, we obtain the equation | = A:0-+ B: 2a, andso B= — 
Similarly, setting x = —a, we get 1 = A+ 2a + B- 0, from which it follows 
that A ==. Thus 
2a 
ee ee 
P= se Jou a> 2a6 =\x 


and, therefore, 


Pe e2 dx : dx 
a—x? dajat+x a—x : 
= 5 Ina + x| — + Inja — x| +c 
= seem + ¢. 
ae 


Thus the third step in applying this method of integration is the de- 
composition into partial fractions, and the fourth and final step is the in- 
tegration of the partial fractions. We shall show later in the section that it is 
always possible to carry out the last step, but, as the next example shows, 
doing so can be tedious. 


a6 7— 14 46 
EXAMPLE 3. Integrate | ~ are a paar ae The degree of 


the numerator is 3 and that of the denominator is 4, so we proceed to the 
factorization of the denominator into irreducibles. It is already written as the 
product of two quadratics, of which x’ + x + 1 is irreducible but x’ — 
5x — 14 is not, since x? — 5x — 14 = (x — 7)(x + 2). Hence the form 
of the partial fractions decomposition is 


Po Sa ee See + = ss Sea 
Ge+x+)D)% —7(*x+2) #*+x4+1 —7 x+2 


The sum of the three fractions on the right side 1s 


(4x Bx — He + 2) 
+ Cox? + x+ Det 2)+ De’ +x+ D@-— 7). 


a a 


(x2 +x + 1)\(x — 7)(x 4+ 2) 
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Equating numerators, we have 


(Ax -- Dye = Fe 2h Ce Je ge ie oe 2p 
+ Dot? + x+ IX — 7) = 3x3 + x? — 14x + 46. 


If we set x = 7 in this equation, then 
CRIS = Wee or C= 2. 
Letting x = —2, we obtain 
Dit(=7) p= 34 or D= —2. 
If we let x — 0, then 


Bes Ce enw 


or 
—14B = 46 — 2C + 7D = 46 — 4 — 14 = 28 or f= =2, 
Finally, letting x = —1, we have 
(—A +B) (-8) + © oe = 
or 


8A = 58+ 8B— C+ 8D = 58 — 16 — 2 — 16 = 24 or , 
The partial fractions decomposition is, therefore, 


3x°+x°-— 14x+46 —  3x—2 aids oo] ae oy 
C2 +x+ 1G? —-—5x—14 xw+x4+1'x-7 x«4+2 
Except for the-first, the terms on the right side are easily integrated. The first 
term can be integrated by writing it as the sum of two fractions. We use the 


identity 


Bx +C By, 26% eg c . a l 
2c) cx2 ++ dx +e 


—) 


oe t+dxte 2ecetdxte 


Note that the numerator 2cx + d is the derivative of cx? + dx + e. Both 
of these fractions can be integrated. In this example, we have 


3X = 2 + %..2 I 7 ft 


ee et ee Se a 


Then 


ax +1 ia 2 
| et a= me +x+1)+4+c. 
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For the second fraction, we complete the square in the denominator. The 
result is 


= eee ere rancsaee 
eam DTC) 


arctan ~~ , it follows that 


la 


fe wot} 
ne ee ieee ee + Cc 
ey hee a = 


dy - ee 
yta fal 


and, since 


Hence " 


3x — 2 7 2x + 1 
a pees 5 In(x* +x+ 1) ~ Tearetan ( V3 )+e 


Returning to equation (4), we therefore get the final integral 


3x" + x" — 14x + 46 
(x2 + x + 1)? — 5x — 14) 
3 Q 7 2x + 1 
= -In(x+x+1 ~ 7 arctan ( =) 
aide V3 V3 
+ 2In|x — 7| — 2In|x + 2| + ¢, 


dx 


and this completes the example. 


We consider next the situation in which the factorization of the de- 


N(x 
nominator of x , as shown in equation (2), contains repeated factors. 


D(x) 


Case 2. The irreducible factors of D(x) are not all distinct. We assume, 
as in Case 1, that the degree of N(x) is less than the degree of D(x). There 


N(x) 
D(x) 


now it is more complicated. By regrouping, we may write the factorization 
of D(x) into irreducibles as 


is still a unique decomposition of into the sum of partial fractions, but 
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where m,...,™, and m,..., Ms, are positive integers, the factors L,(x) = 
a;x + 6; are all distinct, and the factors Q,(x) = c;x? + d;x + e; are all 


N 
distinct. In this case, te is the total sum of the following individual sums 
x 


of partial fractions: Forzeach i =+rl,"7.47, there 1s the sum 


Axi A jn Metin 


ses ae 


ax+b;' Gato ix + bm 


in which the A;; are uniquely determined real constants. Similarly, for each 
j= l,.+. see ss he sum 


— Byx 5 Caz Bj2x i Ce 


cjx? + djx +e; ' (jx? + djx + ej) 


in which the B;;, and C;, are uniquely determined real constants. 


2’ +x+2 
x(x — 1)° 
partial fractions. The degree of the numerator, 2, is less than that of the 
denominator, 4. So we turn at once to the decomposition into partial frac- 
tions. Since the irreducible factor x — 1 is repeated three times, the decom- 

position is of the form 


EXAMPLE 4. Integrate the rational function by the method of 


oe B c Dp: 
wg = 1S ae x 


Ce 
The sum on the right side is equal to 


Ae ey ae Se 
can cin Ge 


Equating numerators, we obtain the equation 
Ax(x — 1)? + Bx(x — 1)+ Cx + D(x — 18 = 2X°4+x+2, (6) 
which holds for all real values of x. Setting x = 1, we obtain 
C l=72-1* + 14a whence C = 5, 
if we let x = 0, then 
Dela) y= whence D = —2. 
Thus, equation (6) has become 


Ax(x — 1)? + Bx(x — 1) + 5x — 2(x — 1) = 2X° + x 4 2. 
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In this equation we let x = 2, getting 


A:-2+8-2+10—2=8+42+2 


or 
2A + 2B = 4. (7) 
Next, setting x = —1, we have 
A-(—1)-—2) + 8B: (—)D(—2) —5 + 6 =2-142 
or 


24 Op 8. (8) 


Subtracting (8) from (7), we get 6A = 12 and so A = 2. It follows that 
= 0, and we have therefore found the partial fractions decomposition to be 


23 | ae ea ier hs 
ws aa eS ee eS eae eee 
Hence 
(5 ae ae ie Oe as | dx dx $e 
fa aes 2 oe Gat 
5 | 
= 2 In|x es l| = 5) a — 9 age 2 In|x| + ¢, 


and this completes the example. 


Since any rational function can be written as the sum of a polynomial 
and a series of partial fractions, the general problem of integrating a rational 
function reduces to three integration problems: (1) integration of a poly- 


nomial; (2) integration of functions of the form where m is a 


A 
(ax + by”? 
positive integer and a # 0; and (3) integration of functions of the form 
Bx +C 
(cx? + dx + e)"” 
first, of course, offers no difficulties whatever. The second is also simple, 
A A a 
et Eee 


(ax + by” a (ax + b)”’ 


ao a 

Gee a tae 
a === ee 
a(l — m) (ax + 6b)" 


where n is a positive integer and d? — 4ce < 0. The 


and so 


+ ¢ WT rest, 


A inlax + b| +c icin =<. 
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The third problem can be solved, but we have seen in Example 3 that it is 
complicated even with n = 1. It is attacked by writing 


oo Bx +e B &«x+d +( _ dB —. 
2c} (cx? + dx + e)” 


2cx +d 
(cx” + dx + e)” 


d. 
| - with u = cx” + dx + e. The problem therefore reduces to finding 
u 


The integral | dx is easily found, since it is of the form 


dx 
(cx? + dx + e)"" 
that this integral can be evaluated by trigonometric substitution, which re- 
duces it to an integral of the form |cos?”~? @ dé. An alternative method is to 
use the reduction formula given in Problem 9, page 384. 

Thus all possible partial fractions resulting from the decomposition of a 
rational function can be integrated. It follows that every rational function 
can be integrated. The factoring of the denominator into irreducibles may 
be difficult, and the decomposition into partial fractions and the resulting 


integrations may be tedious, but the following important result has been 
established. 


However, it was demonstrated at the end of Section 3 


(4.1) THEOREM. Every rational function can be integrated by the method of 
partial fractions. 


A table of integrals will show how to integrate many of the functions 
which are the partial fractions of a rational fraction. For this reason, there 
is no need to memorize the formulas for integration. However, it is necessary 
to know the technique of separating a rational function into its partial frac- 
tions in order*to replace an apparently nonintegrable function by a sum of 
obviously integrable functions. 


PROBLEMS 


1. Separate each of the following into the sum of a polynomial and a sum of partial 
fractions. 


5 ay = 5x + 10 
} Gee © G&G? $2) 
x+2 a ae Se — Fe ee 
Gas. I)(x + 1) ) x? + 4x 
3 2 2 
QS (f) ee oe: 


Berk ay 4 ge hy 4 
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2. Integrate each of the following. 


5 2x° —- a = 2 
" beams af 0) if ga eee 
eave pata Pi 
(b) pastime. a, Cease PO a 


3. Find the partial fractions decomposition of each of the following rational func- 
tions. | 


x — 8 A 

OB exh © BEFD 

18 xf ~ xe 
EFT © ELDet® 
x+1 Ge 4227 ‘ 
(c) ie Ap (g) G@ 4 3) 

8x -+- 25 yo ae SS 
td) KA OX (h) (2x — 1)(x? + 1)? 
4. Evaluate each of the following integrals. 
x — 8 8x + 25 
(a) | pa Eee (d) bass 
18 ae tae © 

© | sper o- | Rees 

x +1 (x + 2)° 
o | ety " ea 

5. (a) Show directly that fone can be written in the form Sei 4 a... 
on. aN eB 2(x — 2)+ 1 
by first writing = op aa TERS 
ax +b 
(b) Following the method in (a), show that Patel s: kp can always be written 
X — 


A B 
ee 


, Where A and B are constants. 


(c) Extend the result in (b) by factoring, completing the square, and dividing to 
show directly that 
ax? ++ bx +c A B C 


b itt —————— + ——_— 
ae can be wri oe ee CS 


where A, B, and C are constants. 
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[Note: Without knowledge of the algebraic theory of partial fractions, it would 
not be unreasonable to assume that a decomposition of a rational function 
2 as would necessarily contain fractions z : pe I : 
P(x)(x — k)? x—k (x —k) 
Dx? + Ex+F 
ee 
composition B = D= £ = 0] 


and 


This problem shows, however, that in the complete de- 


6. Why can there not be an irreducible cubic polynomial with real coefficients? 
7. Integrate each of the following. 


(3x + 1) dx sec y tan y dy 
OO) eae s. ee 2sec?y + Ssecy + 2 
(x” + 1) dx ; ae 
©) | ead? es 
(x — 2) dx 10+ 5z2-—2z 
© | ets Oo | eae 
x ti He SY |= ee (6x + 3) dx 
(d) =a ©) | Get ae eae 
ge (x + 4) dx 
() ie { S824, 
(2x + 1) dx x? + 2x + 5 
(f) Pris Peek, mw) | BAS — 1)(x? + 1) 
(e) 2 sec’x dx (n) —— 
tan2x — 4tanx + 3 : (x? + x + 5) 


8. Prove the statement in the text, page 387, that, since a nonzero polynomial of 
degree n has at most n distinct roots, two rational functions with the same 
denominator are equal if and only if their numerators are equal. [Hint: Suppose 

Nix) _ No(x) 

D(x) D(x) 

Ny(x) — Nox) 

D(x) 
for every real number x for which D(x) ¥ 0.] 


that , where the polynomial D(x) is not the zero function. Then 


= 0, and so the polynomial equation Ni(x) — Ne(x) = 0 holds 


5. Other Substitutions. We have seen in Section 4 that any rational function 


N(x 

can be integrated by the method of partial fractions. This result can be 
x 

extended to show that any rational function of the six trigonometric functions 

can also be integrated. Such a function is defined as the result of replacing 
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ie ee 
each occurrence of x in sh by any one of the six possibilities: sin x, cos x, 
x 
tan x, cot x, sec x, or csc x. An example is the function F defined by 
y= sin'x cos x +.2.tan’x + sec x_ 
cos?x + 3cotx + 1 


which is obtained in the manner just described from the rational function 
ae es ee ee 
xX aes 4 


csc x is a Simple rational function sin x and cos x, 


. Since each one of the four functions tan x, cot x, sec x, and 


sin x 1 
fan x = ) sec xX = ——>? 
cos x COs x 
Cos x 1 
cotx = —-—> esc xX = = : 
sin x sin x 


it follows that every rational function of the six trigonometric functions is 
equal to a rational function of sin x and cos x. Thus, in the above example, 
we have 


- 2 
: sin x 1 
sin’x cos x + 2 ae 
f= cos2x cos x 
cos"x + 3 pees + | 
sin x 


e 3 3 ° 3 . 
_ sin'xcos x + 2 sin x + sin x cos x 
sin x cos¢x + 3 cos’x + sin x cos?x 


It is therefore sufficient to show that every rational function of sin x and cos x 
can be integrated. 

Surprisingly enough, a simple substitution will transform any rational 
function of sin x and cos x into a rational function of a single variable. The 
substitution consists of defining y, a new variable of integration, by the equa- 
tion 


Cr) 


We can express cos x in terms of y by first writing 


pee = oe e008 sin” — 
zZ 2 Z 
3 ges, 2 
Since cos” 5 + sin’ 5 = 1, we have 
2x 9X ox rae a, 
cos = — sin’= cos ~— sin’s 
2 2 zZ 2 
CS Se 


9X Pee 
cos: = sin = 
2 a 2 
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aus x 
Dividing numerator and denominator by cos? 5? we get 


9X 
eae 
1 + tan’ 5 i 


Thus we have obtained the equation 


(2) 
In a similar fashion, 
' x 2 x 
sin x = sin 2-5 = 2 sin 5 COS 5 
aye 
sin 5) re Pe 
ws cos’ 5 — tan 5 [a(1 + cos x)] 
cos = 
- 
x 
= tan 5 (1 + cos x) 
2 
ee 
Hence 
Cae 
Finally, since : = arctan y, or, equivalently, x = 2 arctan y, we have 
tes 


By means of the substitutions given in formulas (2), (3), and (4), any integral 
of a rational function of sin x and cos x can be transformed into an integral 
of a rational function of y. Since the latter can be integrated by partial 
fractions, we have proved that every rational function of sin x and cos x can be 
integrated. 
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cos x dx 
EXAMPLE 1. Integrate | ae tt we letopi= tan ~ then, as we have 
b+ cos X ‘i 
i ny 2 dy 
seen, we may replace cos x by 5» and dx by ; ;: The integral then 
becomes 5s y 
1—y 2dy 
cos x dx_ =f ale 
1+ cosx 1 — y? 
es 
May) dy 


eee es se ye — ¥*) 
_ {2a—-y’% «jis 
[Mae r+ eo 
By division one finds that 


EE 2 
ee ta. 


Hence 


cos x dx 
1+ cosx IG ce 5) 
= —y-+2arctany+c. 


Zz 
But y = tan 5 and x = 2 arctan y, and we therefore conclude that 


cos x dx x 
T + cos x == —tan 5 + x + Cs 


We do not recommend that the above substitution formulas be memo- 
rized. However, one should remember the simple fact that any rational 
function of the six trigonometric functions is equal to a rational function of 
the sine and cosine, and one should also remember that a routine substitution 
procedure exists by which the integral of a function of the latter type can be 
reduced to the integral of a rational function. For the details, one will prob- 
ably want to refer directly to formulas (1), (2), (3), and (4). 

There are other substitutions which simplify integrals, but none of them 
is as standard and automatic as the one just described. For example, 

\/x dx 

1+ Vx 
integration y by the equation y = \/x, the substitution yields a simple in- 
tegral. 


is not readily integrated. However, if we define a new variable of 


400 TECHNIQUES OF INTEGRATION [cHap. 7 


/x dx = 
Teer Let ' page 


Then y? = x and 2y dy = dx. Substituting for \/x and dx, we obtain 
. | ; 
Vx dx =) ee | 2y = 


EXAMPLE 2. Evaluate the indefinite integral 


L4/xsl to I+y 
Division yields the identity 
NO ee 
Sig eases t= 
Hence 
Vx dx I 2 ) 
== 2y —-2+ ——])d 
1+ Vx ee 


=y—2y+2Inll +y| +c. 
Since \/x is nonnegative, we have |1 + y| = |1 + V/x| = 1+ +/x. Thus 
{ /x dx 
1+ vx 
The same integral can be evaluated by a different substitution. Let us 
define the variable z by the equation z = 1 + \/x. Then./x = z — 1 and 


x = (z — 1) and, as a result, dx = 2(z — 1) dz. After substitution, the 
integral becomes 


=x—Wx+2nlli+vx+e. 


Vaedx < | @—De@= he 
1+ vx Z 
: = peak 1) dz 
Z 


| (2-442) ae 


z’ — 4z + 2Inlz| + c. 


Again, since .\/x is nonnegative, we have |z| = |1 + V/x| = 1+ Vx. 
Hence, after substituting back, we get 
Vx dx 


= ee wie? ed Et ee) + 2 In 
1+ +x 


Pa O/s x S d — ae er eee 
R= 2V/x 42a + vo He ee 
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The two solutions in Example 2 differ by a constant, in accordance with 
Theorem (5.4), page 114. The two substitutions differ in the initial goal: 
In the first, we decided that the integral would be simpler if we replaced the 
radical by a new variable, and in the second we decided to replace the de- 
nominator. There is little to choose between the two methods. 


2 — 3 
EXAMPLE 3. Integrate | at dx. If we define the variable y by the 
equation y = (2x + 5)*, then y? = 2x + 5 and 3y’dy = 2dx. Hence 
y>— 5 3y'd 
x= z 5 and dx = = iss Substituting, we get 


fs emilee ay"dy 


(Qx 5a ~ y 2 . 


ee ee ee 
a ee eae 201) 


-: alo’ — 10y* + 13y) dy 
= 3g)" — ty’ + ey’) + 
= &(2x + 5)** — 32x + 5)? + 8802x4 5% +. 


There are no universal rules for integration by substitution. In most 
cases we are interested in replacing an involved function forming part of the 
integrand by a simpler one, frequently by a single new variable. 

In this chapter we have developed a number of techniques for finding 
indefinite integrals, or antiderivatives. However, it is by no means the case 
that these techniques will yield an antiderivative for every integrable function. 
For example, it is impossible to integrate f e**dx in the sense that the word 
“integrate’’ has been used in this chapter. (Since e~*” is everywhere contin- 
uous, an antiderivative certainly exists. In particular, the function F defined 
by 


F(t) = | e-* dx, for every real number 1, 
0 


_is an antiderivative as a result of the Fundamental Theorem of Calculus, 
page 200. However, it can be proved that no antiderivative of e~** can be 
expressed algebraically in terms of functions defined by compositions of 
rational functions, trigonometric functions, and exponential and logarithmic 
functions.) Nevertheless, the methods discussed in this and the preceding 
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section have significantly increased the set of functions whose indefinite 
integrals we can find. 

7 The reader should be aware of the fact that there are in existence excellent 
tables of integrals in which frequently encountered integrals are tabulated. 
No such table contains all tractable integrals, but some are quite complete, 
and they are of immense practical value for those people whose work often 
leads them to problems requiring integration. 


1. (a) 


(b) 


(b) 


PROBLEMS 


Integrate [sec x dx by the technique for integrating rational functions of 
trigonometric functions. 


We have already shown (see page 369) that 
[ sex = In|sec x + tan x| + ¢. 


Show that this solution agrees with the one found in (a) for an appropriate 
choice of the constant c. 


Integrate fcsc x dx by the technique for integrating rational functions of 
trigonometric functions. 


The formula {esc x dx = —In|csc x + cot x| + c is given on page 374. 
Show that this integral agrees with the one obtained in (a) for an appro- 
priate choice of c. 


3. Integrate each of the following. 


(a) 


(b) 


(c) 


(d) 


(e) 


(f) 


| x dx (g) | dy 
we te Vy + Vy 
| x = (h) e tan x tl 
ee ae 1 + tan?x 
sin x dx x 
aa SY Be: (i) » ek cl dx 
| 1 + sinx \/5x rs 
i () * 
sin x + cos x J i «/x)° 
dt e dx 
5G. Bee (k) posite, 
2+ cost V/1 4 ie 
ine | dx 
dx (1) eee 
Ga- + + x)/3 /1 1 e 
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4. Evaluate each of the following definite integrals. 


dx : 
(a) | See (c) | sin’x cos’x dx 
x/g COS?x sin x 0 


(b) f=. (d) | 
0//14+x 1 Wx + 2/x 


SUPPLEMENTARY PROBLEMS 


Integrate each of the following. 


1. | x'cos 2x dx ee / V/4x2 + 1 dx 
dy fs z —— 2 
Z: | 16+) 14. | s(e e )dz 
dy 15. ae 
_ 16 — y? VJ/x + wx 
: xdx 
' | 1 — x - a 


: 


5: | tee ay 17. 
6. | a. 18. 
yVvx-t+i 


¥ [etx tcosxrar loans 


e’'cos t dt 


dx 
1+ e 


20. 


8. [oe + 1) In|t| dt wS 4 x2 


9. — ; 21. CERES 


J. x dx 
V4+ x2 dx ae 


ee + e2r 


i: 


—22z 
11. fs 53, ee 


sin’2x cos’’2x dx 24. | dn y)dy 
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25. | sin’x dx 38. [teas 


26. Jo + 1)Inxdx 39. [va — x2dx 
sec ¢ tan t e’dx 
ZT. Po er ee vat AO. G+ 2e)4 dey 
28. | Pe a, | Via 
ae 2 e449 
x‘dx 
29. | vin x as nx#—1 oo 
In x 2x — a 
30. [Meas 43. am zs 34 
31. | sey dy 44. [. “°"sec x tan x dx 
a. [ince — Die 45, [ae 5° dx 
33 [cos 4y cos 5yd 46. fa c= pos eee 
“ nai 1 + 4sinx 
34. [00s 3x sin 5x dx 47. iE arccos x dx 
dx (St — 11) dt 
oe a 12 — 3t — 10 
dx 3x ax 
eo a 
x2+ 4% + 5 i = ae 
i. x ae 1 
31. NZ Ox 50. ae. 
x —1 


evarter 6 


The Definite Integral 
(Continued) 


1. Average Value of a Function. Let f be a real-valued function of a real 
variable which is bounded on the closed interval [a, b]. Furthermore, let f 
be integrable over [a, b]. Then the mean, or average value, of f on the interval 
[a, 5] will be denoted by M2(f) and is defined by 


b ! 

I | | 

— | f, if a-<_ Dp, 
Mig = 29 = 6 0 
f(a), if a =.b. 
If a < b, then it follows at once from the definition that 
b 
(6 — a) Mi(f) = | f 


This equation is also true if a = 6, for then.both sides are equal to zero. We 
conclude that 


b 
(1.1) | f= (6 — aMi()). 


If f is nonnegative on [a, b], i.e., if f(x) > 0 for every x such that 
a<x <5, then (1.1) yields a good geometric interpretation of the mean 
M?(f). Let P be the set of all points (x, y) such that a < x < b and 
0 < y < f(x), as shown in Figure 1. Then 


b 
area(P) = | f = (6 — a)MiCf). 


It follows that M?(f) is equal to the height of a rectangle with the same base 
and the same area as P. 
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Figure 1 


EXAMPLE 1. Let f be the function defined by f(x) = x° — x + 1. Find the 
average value of f on the interval [0, 2], draw the graph of f, and show on it 
the rectangle with base [0, 2] and area equal to the area under the curve. The 
graph is shown in Figure 2. The mean, or average value, of fis given by 


I 


sig | renee 


2 
| (x* — x+ 1) dx 
0 
t fee 
-3(§-3 +.) 


y=x*-x+1 


Mi(f) 


| 


SS 


— 


Figure 2 
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The words “‘mean” and “average value’? are common in our vocabu- 
laries and have intuitive meaning for most of us. To use them as names for 
M2(f) is a sensible thing to do only if this quantity, as we have defined it, 
has the properties we associate with these words. We shall now show that it 
does. 

First, let us verify that the average value of a velocity function agrees 
with our earlier definition of average velocity. We consider a particle moving 
along a straight line, which we take to be a coordinate axis. The position and 
instantaneous velocity of the particle at time ¢ are denoted by s(¢) and v(?), 
respectively, and we know that s’(t) = v(t). Suppose that the interval of 
motion is from time t = a to time t = b and that a < b. Assuming that v 
is a continuous function, we have 


b 
| u(t) dt = s(b) — s(a). 


According to the definition on page 104, the average velocity vg, is equal to 


Ss 


GL ae 


The mean, or average value, of the function v on the interval [a, b] is given by 


b 
M.v) = ae | v(t) dt 


_ = @) 
eee a 


Hence the two definitions agree. 

The basic properties of the average value of a function correspond closely 
to the basic properties of the definite integral as they are enumerated at the 
beginning of Section 4 of Chapter 4. To begin with, we would expect a func- 
tion which is constant on an interval to have, on that interval, an average 
value equal to the constant value of the function. The following proposition 
states that this is so. 


(1.2) If f(x) = k for every x in the interval [a, b], then M3(f) = k. 


The proof is an immediate corollary of the definition of the mean M?(/f) 
and of Theorem (4.1), page 191. The reader should supply the details. 

If one function is less than or equal to another function on some interval, 
then the lesser one should have the smaller average value. Thus we expect the 
theorem: 


(1.3) If f and g are integrable over [a, b] and if f(x) < g(x) for every x in 
[a, b], then Ma(f) < Ma(g). 
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The proof follows easily from Theorem (4.3), page 191. 

We introduce the third property of the average value of a function by 
means of an example. Suppose that on a 5-hour automobile trip the average 
velocity is 45 miles per hour during the first 3 hours and 30 during the last 2 
hours. What is the average velocity for the whole trip? To get the answer, we 
observe that the total distance traveled is 


AS.>:3 4-30: 2 = 125 mutes. 
The average velocity over 5 hours is, therefore, 
435 = 39 mph. 


If we denote the instantaneous velocity of the automobile by v(t), and assume 
that the trip began at time t = 0, then we can express the fact that the average 
velocity over the first 3 hours was 45 miles per hour by the equation Mj(v) = 
45. Similarly, we are given M3(v) = 30 and have shown that Mj(v) = 39. 
Since 3:45 + 2-30 = 5: 39, we can write 


(3 — 0)Mi(v) + 6 — 3)M3v) = 6 — 0)MG(). 


Abstracting from this example, we conclude that the average value of a 
function should have the property expressed in the proposition: 


(1.4) If f is integrable over [a, b] and [b, c], then 
(b — a)Ma(f) + (c — b)Ms(f) = (c — a)MA(f). 


Proof. Since (b — a)M®(f) = Jf f, the conclusion of (1.4) is equivalent to the 
equation 


a» b ec c 
} 9p ee 


But this is one of the basic properties of the definite integral [see Theorem (4.2), 
page 191], so the proof is complete. 


The next theorem states the properties of the mean corresponding to 
Theorems (4.4) and (4.5), page 191. 


(1.5) If f and g are integrable over [a, b] and if k is any real number, then 


(i) MES) = kMi(f), 
(ii) MoCf+ g) = Ma(f) + Mi(g). 


The proofs are left as exercises. 
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EXAMPLE 2. Let us see whether the definition of average value of a function 
agrees with our intuition in a simple example. Let f be the linear function 
defined by 

f(x) = 9x + I, 


whose graph is shown in Figure 3. What is the average value of f between 
2 and 6? 


y-axis 


f(x) =yx+1 


Figure 3 


We have f(2) = 4-2+ 1 = 2 and f(6) = $:6+1= 4. Since the 
graph of fis a straight line, the region below the curve is a trapezoid. It would 
seem natural for the average value of f on the interval to be the length of the 
median, which is given by 


fQ+f/O) _2+4_, 
2 2 


Computation of M3(f) yields 
6 
min = <1; | axt Das 
s=—), 
1 x 6 
rid (5 us x) 


=HGE+ 6) - @+2)] 
=e ApS = Se 9, 


In motivating the definition of the mean, or average value, of a function, 
we have seen its very close connection with the definite integral. Since a 
beginning student of calculus probably has a greater feeling for the idea of 
average than for that of an integral, it is fruitful to reverse our point of view. 
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That is, if we were to ask the question “‘What really is the definite integral of a 
function?’’, one answer is that it is a weighted average. Specifically, as stated 
in (1.1), the integral J f is equal to the product of 6 — a and the average 
value of f on the interval [a, 5]. 

We conclude this section with a theorem which is sometimes called the 
integral form of the Mean Value Theorem. It asserts that if f is continuous, 
the number M2(f), which we have called an average value, is quite literally 
the value of the function f for some number between a and 5b. 


(1.6) INTEGRAL FORM OF MEAN VALUE THEOREM. /f a < b and if f is 
continuous on the interval [a, b], then there exists a number c such that 
a@<¢ < bend MA) = flo), 


Proof. Since f is continuous at every point of [a, b], it follows by the Fundamental 
Theorem of Calculus that the function F, defined by 


Fix} = f(t) dt, for every x in [a, 5], 
is differentiable. Furthermore, 


F' (xy = Fee for every x in [a, 5]. 


A differentiable function is necessarily continuous [see (6.1), page 55], and 
so F more than satisfies the hypotheses of the Mean Value Theorem, (5.2), 
page 113. That theorem therefore implies that there exists a number c such 
that-a < c < dDand 


F(b) — F(a) = (6 — a)F'(c). 


But Fe) = fosand 


b 
F(b) — F(a) = | f(x) dx. 


Hence 


b 
| aja = 41 — Bre), 


and so 
. 
jicl = alae F f(x) dx = M.(f). 


This completes the proof. 
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PROBLEMS 
1. Find the average value M2(f) of f on the interval [a, b], where 
(a) f(x) = x? — 2x + 1 and [a, b] = [0, 2]. 
(b) f(x) = 2x* and [a, 5] = [—1, 1]. 


sand [c, 5) = [1, 21. 
XK 


fax) = ss and [a, b] = [1, n], where n is a positive integer. 
x 


(e) f(x) = sin x and [a, b] = [0, z]. 
(f) f(x) = Inx and [a, db] = [1, 5]. 


2. In each of the following find M2(f), draw the graph of f, and superimpose 
on the graph a rectangle with base [a, b] and area equal to the area under the 
curve y = f(x) between a and b. 


f(x) = x, a =. —1, ands =. 4 
ety) = x*, a = 0, and b= Tt. 

(c) f(x) = 4- «&« — 1)’, a = 0, and b = 3. 
maf) = e*, a = 0, andb:= 2. 


me fix) = cos.x, a. = 0, and). — = 


3. Each of the propositions (1.2), (1.3), (1.4), and (1.5) corresponds to one of the 
basic properties of the definite integral as they are enumerated in Theorems (4.1) 
through (4.5), page 191. In general, the proof of each is obtained by checking the 
special case a = b separately and then using the formula 


b 
M(f) = io | f(x)dx, fora <4, 
a 8 


together with the appropriate property of the integral. 
(a) Prove (1.2) (b) Prove (1.3) (c) Prove (1.5). 


4. A stone dropped from a cliff 400 feet high falls to the bottom with a constant 
acceleration equal to 32 feet per second per second. That is, 


m= v0 tt) =-s"G) = 32, 


where the direction of increasing s is downward. If the stone is dropped at time 
t = 0, find the time it takes to reach the bottom of the cliff, and the mean velocity 
during the fall. 

5. A typist’s speed over an interval from t = 0 to t = 4hours increases as she 
warms up and then decreases as she gets tired. Measured in words per minute, 
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suppose that her speed is given by v(t) = 6[4? — (t — 1)?]. Find her speed at 
the beginning, at the end, her maximum speed, and her average speed over the 
4-hour interval. How many words did she type during the 4 hours? 

6. A particle moves during the interval of time from t = 1 second to t = 3 seconds 
with a velocity given by v(t) = #? + 2t + 1 feet per second. Find the total 
distance that the particle has moved and also the average velocity. 

7. For each of the functions and intervals in Problem 2, find a number c such that 
a<ec< band Mf) =7@.: 

8. An arbitrary linear function f is defined by f(x) = Ax + B for some constants 
A and B. Show that 

mip = LO+4 £0), 

9, Let x(¢) be the number of bacteria in a culture at time ¢, and let x» = x(0). The 
number grows at a rate proportional to the number present, and doubles in a 
time interval of length 7. Find an expression for x(t) in terms of xo and 7, and 
find the average number of bacteria present over the time interval [0, 7]. 


2. Riemann Sums and the Trapezoid Rule. This section is divided into two 
parts. The first is devoted to an alternative approach to the definite integral, 
which is useful for many purposes. The second is an application of the first 
part to the problem of computing definite integrals by numerical approxima- 
tions. We begin by reviewing briefly the definitions in Section 1 of Chapter 4. 
If the function fis bounded on the closed interval [a, b], then, for every parti- 
tion o of [a, b], there are defined an upper sum U, and a lower sum L,, which 
approximate the definite integral from above and below, respectively. The 
function f is defined to be integrable over [a, b] if there exists one and only 
one number, denoted by f J, with the property that 


b 
a u< | fst, 


for every pair of partitions o and 7 of [a, 5]. 

In the alternative description of the integral, the details are similar, but 
not the same. As above, leto = {xo,..., Xn} bea partition of [a, b], which 
satisfies the inequalities 


aS hy LX ee 


In each subinterval [x;_1, x;] we select an arbitrary number, which we shall 
denote by x*. Then the sum 


=P es) 


t=1 
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is called a Riemann sum for f relative to the partition 0. (The name com- 
memorates the great mathematician Bernhard Riemann, 1826-1866.) It is 
important to realize that since x* may be any number which satisfies x;_1 < 
x* < x;, there are in general infinitely many Riemann sums R, for a given f 
and partition o. However, every R, lies between the corresponding upper 
and lower sums U, and L,. For, if the least upper bound of the values of f 
on [x;_1, x;] is denoted by M; and the greatest lower bound by m,, then 
f(x*) is an intermediate value and 


m; < f(xi) < Mi, 


as shown in Figure 4. It follows that 


n 


DS miei — xa) $Y SOD Ges = 11) SY Maes — 1), 


i=1 


which states that : 
| a 


for every Riemann sum R, for / relative to o. 


v = f(x) 


Xi xr. 3B X-axis 


Figure 4 


Every Riemann sum is an approximation to the definite integral. By 
taking partitions which subdivide the interval of integration into smaller and 
smaller subintervals, we should expect to get better and better approxima- 
tions to |, f. One number which measures the fineness of a given partition a 
is the length of the largest subinterval into which o subdivides [a, 6]. This 
number is called the mesh of the partition and is denoted by ||a||. Thus if 
@ = 1%, ..., Xn} 18 a partition-of [a, b] with 


Ge Ry BO, SS hy =O, 
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then 
|o|] = maximum {(x; — x;_1)}. 
i: tag ot 


The following definition states precisely what we mean when we say that the 
Riemann sums approach a limit as the mesh tends to zero. Let the function f 
be bounded on the interval [a, 5]. We shall write 


Hit ay = J. 

Io || 0 
where L is a real number, if the difference between the number L and any 
Riemann sum R, for fis arbitrarily small provided the mesh ||c|| is sufficiently 
small. Stated formally, the limit exists if: For any positive real number e, 
there exists a positive real number 6 such that, if R, is any Riemann sum for 
f relative to a partition o of [a, b] and if ||o|| < 6, then |R, — L| < «. 

The fundamental fact that integrability is equivalent to the existence of 

the limit of Riemann sums is expressed in the following theorem. 


(2.1) THEOREM. Let f be bounded on [a, b]. Then f is integrable over [a, 5] 
if and only if lim R, exists. If this limit exists, then it is equal to = :S 


[ello 


The proof is given in Appendix C. 


EXAMPLE 1. Using the fact that the definite integral is the limit of Riemann 


sums, evaluate 
Wi a2 ee ae 


n3/2 


lim 


no 


The numerator of this fraction suggests trying the function f defined by 
f(x) = \/x. If o = {xo,..., Xn} is the partition of the interval [0, 1] into 


l 
subintervals of length — , then 
n 
Xo = 0, Xi, = 
) i 2 = See Ss 


7 ae 


In each subinterval [x;_,, x;] we take x* = x;. Then 


fod = feo < 2 vi, 


SEC. 2] RIEMANN SUMS AND THE TRAPEZOID RULE 415 


and the resulting Riemann sum, denoted by R,, is given by 


R, = f(xi (xi — Xi) = ar 
2 1 V/n® 
l< = 
= man Du V! 
Vises 
n3/2 


The function »/x is continuous and hence integrable over the interval 
[0, 1]. Since ||o|| > 0 asm — o, it follows from Theorem (2.1) that 


b 1 
[ t= | vea- ee 
a 0 no 

Hence 


ees 2s Soe ee 


ets. lias 


1 


ee 
0 3) 


and the problem is solved. 


Theorem (2.1) shows that it is immaterial whether we define the definite 
integral in terms of upper and lower sums (as is done in this book) or as the 
limit of Riemann sums. Hence there is no logical necessity for introducing 
the latter at all. However, a striking illustration of the practical use of Rie- 
mann sums arises in studying the problem of evaluating definite integrals by 
numerical methods. 

In spite of the variety of techniques which exist for finding antiderivatives 
and the existence of tables of indefinite integrals,there are still many functions 
for which we cannot find an antiderivative. More often than not, the only 
way of computing J, f(x) dx is by numerical approximation. However, the 
increasing availability of high-speed computers has placed these methods in 
an entirely new light. Evaluating - f(x) dx by numerical approximation is no 
longer to be regarded as a last resort to be used only if all else fails. It is an 
interesting, instructive, and simple task to write a machine program to do the 
job, and the hundreds, thousands, or even millions of arithmetic operations 
which may be needed to obtain the answer to a desired accuracy can be 
performed by a machine in a matter of seconds or minutes. 
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One of the simplest and best of the techniques of numerical integration 
is the Trapezoid Rule, which we now describe. Suppose that the function fis 
integrable over the interval [a, b]. For every positive integer n, let o,, be the 
partition which subdivides [a, b] into n subintervals of equal length A. Thus 
6, =m, os 5 Se a 


Oe eae 
ee 


h 


= Xj — Xj_1, ea 1,...,H. 
It is convenient to set 


Sere we ie a ene 


Then the Riemann sum obtained by choosing x7 to be the left endpoint of 
each subinterval [x;_1, x;], i.e., by choosing x* = x;_1, is given by 


S fi) — Xia) = = f(%iayh = hy Yi-t. 


Similarly, the Riemann sum obtained by choosing x* to be the right endpoint, 
ie., by taking x*¥ = x,, is 


S Six; he) = DS f(xph aa ee Lae 


The approximation to { a f prescribed by the Trapezoid Rule, which we denote 
by T;,, is by definition the average of these two Riemann sums. Thus 


n> post hd Vi 
ree var 
So rhe 


=3(Sat Lx): (1,9 


: ¥;, 


The last expression above can be simplified by observing that 
n n n—l 
dD Vat Dove = vo + 2D) vit yn. 
t=1 t=1 t=1 

Hence 


n—l 
Tn = h(Syo + D> vi + Fyn). (2) 


t=1 
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We shall express the fact that T,, is an approximation to the integral J % f by 
writing a f = T,. The Trapezoid Rule then appears as the formula 


-b 
(2.2) | Be Pi ee nt + Bn). 


Why is this formula called the Trapezoid Rule? Suppose that f(x) = 0 
for every x in [a, b], and observe that 


b Ly ro rn 
[r-] r+ eae ae 


The area of the shaded trapezoid shown in Figure 5 is an approximation to 
Se f. This trapezoid has bases y;_, = f(x;-1) and y; = f(x:) and altitude 


h 
h. By a well-known formula, its area is therefore equal to 5 (yi—1 + iy): 


Xj-1 Xj X-axXIS 


Figure 5 


The integral | f is therefore approximated by the sum of the areas of the 
trapezoids, which is equal to 


> é Gah) = Ay yat >, vs) 
iat faa iat 


Equation (1) shows that this number is equal to T,,. 
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EXAMPLE 2. Using the Trapezoid Rule, find an approximate value for 
= (x* + x*) dx. We shall subdivide the interval [—1, 1] into n = 8 sub- 
intervals each of length h = 4. The relevant numbers are compiled in Table 1. 


TABLE | 
eee ee ee ee 
04 3 1—1=0 
e202 5 =: 
21 2 ae er = ee 
S| a 45 ot 
4 0 0+0=0 
5| tla+a-& 
OC ee ge eet 
7| $\%+#-8 
8 l l1+1=2 
Hence 
Ts = a(3¥o0 + yi +++ + yz + Sys) 
_1(g, 948 +a 0t eee | 
4 64 
a yl eee ae 
eee oes 7 ees © ee 


With the Fundamental Theorem of Calculus it is easy in this case to compute 
the integral exactly. We get 


- : : r Ps 
: | +x)de= (44%) 


The error obtained using the Trapezoid Rule is, therefore, 


—l 


In Example 2, the error can be reduced by taking a larger value of 1, or, 
= a ; 
equivalently, a smaller value of A = = . Indeed, it is easy to show that in 
n 


any application of the Trapezoid Rule the error approaches zero as n increases 
without bound (or as / approaches zero). That is, we have the theorem 


b 
(2.3) lim T,, = | F(x) dx. 
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Proof. It follows from equation (1) that 
1 n n 
Tr = (AD ee ae es ya): 


Both Ad vy=1 3-1 and h>-;~1 y; are Riemann sums for f. Hence, by The- 
orem (2.1), both approach the integral asn — «©. Thus 


and so : 
b b b 
hm. Ty = 1) | f(x) dx + | f(x) ax| = | fixe, 
and the proof is complete. 


As a practical aid to computation, Theorem (2.3) is actually of little 
value. What is needed instead is a method of estimating the error, which is 


equal to 
b 
[i s-n 


for a particular choice of n used in a particular application of the Trapezoid 
Rule. For this purpose the following theorem is useful. 


5 


(2.4) If the second derivative f’’ is continuous at every point of [a, b], then there 
exists a number c such thata < c < band 


b 
S26 4h, 2 
[ r-n-*58rrow., 


An outline of a proof of this theorem can be found in J. M. H. Olmsted, 
Advanced Calculus, Appleton-Century-Crofts, 1961, pages 118 and 119. 

To see how this theorem can be used, consider Example 2, in which 
f(x) = x? + x? and in which the interval of integration is [—1, 1]. In this 
case f’’(x) = 6x + 2, from which it is easy to see that 


T ie as for every x in [—1, 1]. 
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Applying (2.4), we have 


[ots Vide = Te eae fC. 


Hence the error satisfies 


1 
[ ot+xax— 7) -arror 
—l 
<4-8-h° = Sh’. 
It follows that by halving h, we could have quartered the error. If we were to 


h 
replace h by 10° which would be no harder with a machine, we would divide 


the error by 100. 


PROBLEMS 


1. For each of the following limits, find a function f(x) such that the limit is equal 
1 Sills 
to J, f(x) dx. Evaluate the limit. 


ee eee 


n—00 "C 

6) tim Se Ce 
n—0 ne 

ey te VitntvV2+n4+V34n4++::+Vntn 
aoe n3/2 


r 1 1 1 1 
(d) im = ( Sa ee eee i 
naw /n VY1+n J/2+n J/3+n s/n +n 


2. Prove that 


1 
> in? = tim (Et gt : +1). 


(b) r= lim S/T + Ve Bt $V WP). 


3 n 
(c) | (x’ + 1)dx = lim 55 tn + Zin + DF}, 


1 J veel ok & 1 
@ § = lim (+ iets Soa 
5 nat = Tt aie /4n — 
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3. Use the Trapezoid Rule with n = 4 to compute approximations to the following 
integrals. In (a), (b), (c), and (d), compare the approximation obtained with the 
true value. 


1 1 
2 dx 
(a) ; Ge =P Thar (f) | fac 
2 
(b) (x° + 1) dx (g) | e* dx 
0 0 
3 1 
ted 
1 : sin x 
(e) aes dx Gj) | : dx 


4. Show geometrically, without appealing to Theorem (2.4), that the approximation 
T,, obtained with the Trapezoid Rule has the following properties. 


(a) If f(x) > 0 for every x in [a, b], then T, > Jf. 
(ob) 1i f(x) <0 for every x im fa, 6], then.T, < Me i 


5. For each of the following integrals, use Theorem (2.4) as the basis for finding 
the smallest integer n such that the error |{, f — T,| in applying the Trapezoid 


; oe =5 
Rule is less than ) > (il) a0 0.000 and (i) 10 
4 2 
1 1 
(a) | — dx (d) { — ax 
1 6x? a.” 
1 
3 
(b) | (8x — 5x ++ 3) dx (e) | eo 
2 1 
(c) | x + 1) dx (f) | e* dx. 
—l 0 


6. (For those who have access to a high-speed digital computer and know how to 
use it.) Compute the Trapezoid approximation 7, to each of the following in- 
tegrals. 


ui 
1 
(a) | faa for n = 10, 100, and 1000. 
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(b) eae sre ae) for successive values a n = 10,-100, 1000,:. . ., untilthe 
1+ 
error is less than 10 — 


(c) | /1 + x3 dx, for n = 5, 50, and 500. 
0 


(d) | a de, for n = 2, 4, 8, 16, and 100. 
0 


(e) | e dx, for n. = 10,'100, and 1000. 
0 


3. Numerical Approximations (Continued). Two additional methods of 
integration by numerical approximations, which we shall describe in this 
section, are the Midpoint Rule and Simpson’s Rule. 

In the Midpoint Rule the a diok te pecan to the integral f fis a Riemann 
sum >-7_, f(x*)(x; — x;_1) in which each x* is taken to be the midpoint of 
the subinterval [x;_1, x;]. In more detail: Let f be a function which is inte- 


grable over [a,b]. For every positive integer n, let h = 7a , and let 
O, = {x0,...+,5 Xn} be the partition deminer oy 
x; = a-+ih, ( 
As a result, it follows that 
x; — Xia = &, 2 eo 2 
If we take x* to be the midpoint of the subinterval [x;_1, x;], then 


ee * X;1+%x; ; 
x =F f= Looe 


Zz 


The Riemann sum used as the approximation to the integral in the Midpoint 
Rule will be denoted by M,,. It is given by 


M, = > FPG: — xi) = DS f(BE2) : 


In studying the Trapezoid Rule, we found it convenient to use the abbrevia- 
tion y; = f(x,), fori = 0,...,”. By analogy, we shall here let 


ran = f(Zt*). i 2 res 
Then 


M,, = >> Je = A(Q1;2 a 3/2 = stale Yn—1/2)s 
I=1 : 
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and we express the Midpoint Rule for numerical integration by the formula 


b 
(3.1) | 7 ae M,, — h(y1/2 + Y3/2 +eeet Vu~i/2). 


a (x) > 0 for every x in [a, b], the Midpoint Rule approximates the 
integral [ f, which is the area under the curve, by a sum of areas of rectangles, 


as illustrated in Figure 6. 


Figure 6 


An alternative to approximating the integral by a Riemann sum is to 
use straight-line segments which are tangent to the curve y = f(x) at the 
midpoints of the subintervals. An example is shown in Figure 7, in which 


y =f (x) 


Figure 7 
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iv jf is approximated by the sum of the areas of the three shaded trapezoids. 
This method yields the so-called Tangent Formula. It turns out, however, 
that the Tangent Formula is the same as the Midpoint Rule. The reason can 
be seen by looking at Figure 8, in which the area of the shaded trapezoid 
with one side tangent to the curve is the /th term in the approximating sum 


Riss x; x-axis 


Figure 8 


used in the Tangent Formula. The area of this trapezoid is equal to 
h 
a + y’’), where y’ and y” are the lengths of the bases. However, by 


elementary geometry the trapezoid can be seen to have the same area as the 
rectangle with base [x;_1, x;] and altitude y;_1/2. The area of the rectangle 
ish: Vi-—1]25 and so , 
SO +") = he Vinay 
(Incidentally, note that this equation is true regardless of whether y’, y”, 
and y;_1/2 are positive, negative, or zero.) Since the product h- y;_1/2 1s 
the 7th term in the midpoint approximation M,, we conclude that the Tangent 
Formula and the Midpoint Rule, although differently motivated, are in fact 
the same. 


EXAMPLE 1. Approximate [ - (x? + x*) dx using the Midpoint Rule. This 
is the same integral which we evaluated in Section 2 by the Trapezoid Rule. 
To compare the two methods, we shall again take n = 8 and 


Re eS 


8 4 


Since, for an arbitrary interval [a, b], we have 


x; = at ih, Pa a, 
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it follows that 
* Ma tx _ (a+ G— Al+ @+t ih) 
- 


xi, = 9) 


a+ (Gi — $)h. 


In addition, since y;_1)2 = f(x), we have a pair of useful formulas: 


i =at(i-— dh 
lr, Se 


= f(a+ (i— ¥)h) 


x= -1+(G-}4 => 


(1-2) «flict 8(2i — 9)' + Qi - 9° 


Yi-1/2 = 83 


(i 087i —1) 


Table 2 contains the numbers for the computation of Mg. 


TABLE 2 


_ @i - 9) Qi = 1) 


l siya or ae eae 
49 

| yin = B53 
D528 75 

2 2) pee eee 79 
as 45 

3 Pre aerate 
7 

4 Vie = 83 
9 

5 Vaio > Q3 
9-11 99 

6 Jupp = oe = Q3 

5 eo GS 723) 

Jiu = oS = 33 

49-15 439 
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Hence we obtain 


M; = 4(/1/2 ioe hy Vis /2) 


= pI +S +45 +7494 99 + 325 + 735) 
_ 1344 _ 2i 
ee 


as an approximation to the integral ie (x? + x’) dx. The value obtained 
earlier with the Trapezoid Rule was 7; = +¢. Since the true value is given by 


1 
| Gat wae 4, 
—1 
it follows that the error using the Midpoint Rule is equal to 


B- Bl = de 


This is one half the error obtained using the Trapezoid Rule with the same 
value of A. 


In any application of the Midpoint Rule, the error | : f — M,,| can be 
made arbitrarily small by taking n sufficiently large. That is, we assert that 


(3.2) 


This theorem is easier to prove than the corresponding one for the 
Trapezoid Rule because every approximation M,, is, as it stands, a Riemann 
sum of f relative to the partition a, of [a, b]. Since ||a,,|| > 0asn > », itis 
a direct corollary of the fundamental theorem on the limit of Riemann sums 
[(2.1), page 414] that lim M,, = ff 


A means of estimating the error a f — M,]| in a particular application 
of the Midpoint Rule is provided by the following theorem: 


(3.3) If the second derivative f"’ is continuous at every point of [a, b], then there 
exists a number c such thata < c < band 


b 
b—a ” 2 
[ r=. + 5S rron. 
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As with the analogous theorem for the Trapezoid Rule [(2.4), page 419], 
this theorem can be proved by first reducing it to the casen = 1. A discussion 
of the error term can be found in R. Courant and F. John, /ntroduction to 
Calculus and Analysis, Volume IJ, Interscience Publishers (Wiley), 1965, 
pages 486 and 487. 

Theorem (3.3) can be used to obtain an upper bound on the error 
if f' — M,| provided the second derivative f’’ is bounded on the interval 
[a, b]. That is, if there exists a real number K such that 


Wie? 3 for every x in [a, b], 


then, in particular | f’’(c)| < K, and so 


ee! 


If"o)| : 


(b= a)K 2 
S 24 - 


For the one integral we computed by both methods, the Midpoint Rule 
gave a better approximation than the Trapezoid Rule by a factor of 2. Com- 
parison of Theorem (3.3) with (2.4) shows that in general this ratio can be 
expected. 

Geometrically, the different methods of numerical integration described 
thus far in this and the preceding section are all based on approximating the 
area under a curve by a sum of areas of rectan gles or trapezoids. Analytically, 
in each method the approximation to , f has been obtained by replacing f 
over every subinterval by a linear function. In Simpson’s Rule, however, we 
shall replace f over each subinterval by a quadratic polynomial Ax? + 
Bx + C. The corresponding area problem is the simple one of finding the 
area under a parabola (or a straight line, if 4 = 0). For most integrals, 
Simpson’s Rule gives much greater accuracy for the same value of A. 

Let f be a function which is integrable over the interval [a,b]. The 
procedure differs from the others in that we consider only partitions of 
[a, b] into an even number of subintervals. Thus for an arbitrary even 


b—a 
integer n > 0, we set h = ———, and let 
n 


a + ih, 


pr FU}, cn 0... .,8 


Xi 
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Since n is even, there is an integral number of ‘“‘double” intervals [x2;_», x2;], 


oe eae : , as illustrated in Figure 9, and 


Figure 9 


27 


b n/2 
| {@jde= > f(x) dx. 


I=l1 %2i—2 


There exists one and only one polynomial g,;(x) = A;x? + B;x + C; of 
degree less than three whose graph passes through the three points 
(X23—25 Voi—2)5 (Xoi—1, Voi—1), and (X2;, Yo;) (see Figure 10). Over each double 
interval [x2;_2, X2;] we shall approximate the integral of f by the integral of q;. 
Let us assume for the moment, and later prove, the fact that 


| qi(x) dx = Ons + 4737-1 + Vaz). ( 1 ) 


21-2 
y-axis 


(X2;-1 » 2-1) 


(X2j-2 5 V2i-2 ) 


(X2i, yr) 


y = qi(x) 


Figure 10 
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The sum of these integrals, which we shall denote S,, is the approximation to 
f f prescribed by Simpson’s Rule. Hence 
A n /2 
Ss, = 3 val (Voie + 4yor_1 + yz). 


If this sum is expanded, note the pattern of the coefficient of the y,’s: If i is 
odd, the coefficient of y; is 4. If 7 is even, the coefficient is 2, with the excep- 
tion of yo and y,, each of which has coefficient 1. Thus Simpson’s Rule is 
expressed in the formula 


b 
I ae Sn = 5 (0 + Ayn + ye + ys + 2 
qeoeee oi) ae a AVn—1 — Yn) 


(3.4) 


We now prove equation (1). The algebra is significantly simpler if we 
write g,;(x) in terms of x — x»;_1 instead of x (see Figure 10). Hence we let 


Ei) antag) Bi — Noa) i: (2) 


The integral ea _, U(x) dx may be computed by substituting u = x — Xe;_1 


and using the Change of Variable Theorem for Definite Integrals (see 
page 215). Since X;. — Xy—, = —A and xy; — x, = A, the result is 


X27 h 
| qi(x) dx = | (au + Bu + 7;) du 
zx —h 


21-2 
< a jue Bu 
= 3 2 : 


= 7 (aah + 67). (39 


h 


Setting first x = x»;_1 in equation (2), we obtain 
ee 1 Se a 8 = 1%. 
Next we let x = x2;_. and xX = X:; to get 
Juss = Gi(Xo;_2) ag” re Bdltrtk: Va, 
and 
Yo = QilXa) = ah? + Bh + 7%. 


Adding, we obtain 
Vue + Py = 2a,h? “> 2Y;. 
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Since Y; = yo;_1, it follows that 
ash? + 6Y; = Yor—o + 4yaz_1 + Ya. 


Substituting this result in (3) yields the desired result (1), and the derivation of 
Simpson’s Rule is complete. 


I 
EXAMPLE 2. Using n = 4, find an approximate value of | 1 dx by 
0 


1-0 1 
simpsons Rite, We have = ————— = 


x? 


4 a" 
a ees eee 
iF ee ~ = ie | ae ee ee 8 
eo a 
oa 


Table 3 contains the numbers necessary for the computation. 


Hence 


 ( + 4 Ae Se 


= 64 32 64 16 
srl Fy F gore ss Foe 


: = 0. 785992...5 


Ns 
! 


We know that 


1 
] * and 
| tan dx ="arctn & ee te ke 


Hence the error | - f — S,| is approximately 0.000006. 
Just as with the other two methods of integration by numerical ap- 
proximation, the error |{, f — S,,| can be made arbitrarily small by taking 1 


sufficiently large. That is, we have the following theorem, which we state 
without proof. 


(3.5 ) 
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In addition, the next theorem can be used to estimate the error in a 
particular application of Simpson’s Rule. 


(3.6) If the fourth derivative f ” is continuous at every point of [a, b], then there 
exists a number c such thata < c < band 


b 
bg es. 
| fs. fn. 


For an outline of a proof, see J. M. H. Olmsted, Advanced Calculus, 
Appleton-Century-Crofts, 1961, page 119. 
The fourth derivative of every cubic polynomial is identically zero, for if 


F(x) = a3x® + aox? + ax + ao, 


then f(x) = 0. It is therefore a rather surprising corollary of Theorem(3.6) 
that Simpson’s Rule will always give the exact value of the integral when 
applied to any polynomial of degree less than 4. 


PROBLEMS 


1. Use the Midpoint Rule with n = 4 to compute approximations to the following 
integrals. In (a), (b), (c), (d), and (e) compare the result obtained with the true 


value. 
1 2 
(a) | i 4 Lax (e) | /1 + x dx 
0 0 
3 27 
(b) | (6x — 5)dx (f) | sin’x dx 
at 0 
3 1 
(c) | 2 dx (g) e* dx 
1x 0 
3 ; 1 


2. (a) through (h). Compare M,, the Midpoint Approximation computed in 
Problem 1, to 74, the corresponding Trapezoid Approximation. 

3. (a) through (h). Use Simpson’s Rule with n = 4 to compute an approximation 
to the corresponding definite integral in Problem 1. 

4. Show geometrically that, if the graph of f is concave up at every point of the 
interval [a, b], then the Midpoint Approximation is too small and the Trape- 
zoid Approximation is too big; i.e., 


b 
m.< | | 2 ae ae 
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5. Do Problem 4 analytically by using the remainder formulas (2.4) and (3.3). 

6. Show that Simpson’s Approximation is the weighted average of the Trapezoid 
Approximation and the Midpoint Approximation. Specifically, for any even 
positive integer n = 2m, show that 


Sn = 3Tm + &Mn. 


7. Prove Theorem (3.5), i.e., if fis integrable over [a, b], then 


by showing that it is a direct corollary of the result of Problem 6 and the two 
b b 
corresponding theorems, lim 7, = Jf, fandlim M, = J, f. 


8. For each of the following integrals and each of the three methods of numerical 
integration (Trapezoid Rule, Midpoint Rule, and Simpson’s Rule), find the 
smallest integer m such that the error obtained is less than 10~4. As the basis for 
finding n, use Theorems (2.4) (3.3), and (3.6). 


4 2 2 
(a) | (ss + =) dx (b) | er dx. 
9. This problem is analogous to Problem 6. Show that, for any positive integer n, 
Ton = $1 n + 9Mn. 
10. Suppose that the graph of f is concave up at every point of the interval [a, 5]. 
(a) Using the results of Problems 4 and 9, show that 


b 
Pon <3 T,, a T2n) %, | i= Tin, 


for every positive integer n. 


(b) Hence show that the error |f E f — T2,| in the Trapezoid Approximation 


satisfies 
b 
| i> Pon 


(c) Show that (b) also holds if the graph of f is concave down at every point of 
[a, 5]. 


< (1%, — Fal: 


4. Volume. In this section we shall use Riemann sums to derive several 
integral formulas for the volume of solids. The concept of volume is a familiar 
one, and we shall not give a mathematical definition of it. As a result, the 


SEC. 4] VOLUME 433 


formulas will be derived from properties of volume which seem intuitively 
natural and which we shall assume. By a solid we mean a subset of three- 
dimensional space. 

A subset Q of three-dimensional space is said to be bounded if there 
exists a real number k such that, for any two points p and q in Q, the straight- 
line distance in space between p and q is less than k. Alternatively, Q is 
bounded if and only if there exists a sphere in space which contains Q in its 
interior. In finding volumes of solids, we shall restrict ourselves to bounded 
subsets of three-dimensional space. 

Let Q be a bounded solid, such as the one shown in Figure 11. We 
choose an arbitrary straight line in space for a coordinate axis. Any point 


Figure 11 


on the line may be chosen for the origin and either direction as the direction of 
increasing numbers. The scale on the line must agree with the scale of dis- 
tance in space. That is, if two points on the line correspond to the numbers x 
and y, then |x — y| must equal the straight-line distance in space between 
the two points. 

For every number x, let P, be the plane figure which is the intersection of 
the solid Q and the plane perpendicular to the coordinate axis at x (see 
Figure 11). The second assumption we make about Q is that every set P, 
has a well-defined area. We then define a cross-sectional area function A by 
setting 

A(x) = aredt.). 


One consequence of the fact that Q is bounded is that there exists an interval 
[a, b] such that, for every x outside of [a, b], the set P, is empty and so 
A(x) = 0. Another consequence is that the function A is bounded on 
[a, 5]. 

Let Ws new consider a partion o = {X,.:.,x,} -of the interval 
[a, b] which satisfies the inequalities 


Oe ym oS to Xe = D: 
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In each subinterval [x;_1, x,], we choose an arbitrary number x*. The pro- 
duct A(x#)(x; — x;—1) is equal to the volume of a right cylindrical slab with 
constant cross-sectional area A(x*) and thickness x; — x;_1. If x; — x;_1 
is small, then we should expect A(x*)(x; — x;_1) to be a good approximation 
to the volume of the slice of Q which lies between the parallel planes at 


Nias | x; X-axis 


x; 
Figure 12 


X;-1 and x; (see Figure 12). Hence if the mesh of the partition o is small, 
then the Riemann sum 


SY) A(X xi — Xi-1) 

ig’ 
will be a good approximation to the volume of Q. Moreover, the smaller the 
mesh, the better the approximation ought to be. Therefore, we shall assume 
that if the volume of Q is defined, then it is given by 


vol(Q) = lim - Ae Wee a5). 


II7l> 


It follows immediately from the integrability criterion, Theorem (2.1), page 
414, that 


b 
(4.1) vol(Q) = | A(x) dx. 


This is a basic integral formula for volume. 
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EXAMPLE 1. Find the volume V of a pyramid of height / with a square base of 
length a on a side. The pyramid is shown in Figure 13(a). We choose a 
vertical x-axis with origin at the apex and which cuts the center of the base at 
x = h. The cross-sectional area A(x) of the intersection of the pyramid 


origin 


y 


iti 


| a : 2 aes e 
f - | yf direction of 
increasing x 
(a) (5) 
Figure 13 


and the plane perpendicular to the coordinate axis at x is easily seen from the 
side view [Figure 13(b)]. Since corresponding parts of similar triangles are 
proportional, it follows that 


Hence 


and so 


ee a See ee 2 

ee 3 4 h. 
This is the well-known result that the volume of a pyramid is equal to one 
third the product of the height and the area of the base. 


EXAMPLE 2. The axes of two right circular cylinders P and Q of equal radii a 
intersect at right angles as shown in Figure 14(a). Find the volume of the 
intersection PM Q. We choose an x-axis perpendicular to the axes of both 
cylinders and which passes through their point of intersection. This point 
is chosen for the origin, and the direction of increasing x is upward. 
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Figure 14(b) helps one to see that the cross sections of P MN Q perpendicular 
to the x-axis are squares. From the end-on view in Figure 14(c), it is ap- 


(5) (c) 
Figure 14 


parent that the cross section at x is a square with an edge of length 
2,/a* — x?. Hence 


AQ) = Ose = 4 = 4a — =). er 
By the integral formula for volume, therefore, 


vol(P N Q) = | A(x) dx = 4 | ie = 2 ee 


Since a? — x? is an even function, the integral from —a to a is twice the 
integral from 0 to a. Thus 


| 
(ore) 
Sine! 

yen 
mi 

* 
ae 
oa 


vol(P A Q) 


| 
fore) 
ea 
Q 
bee 
| 
Uo | ot. 
a ee 
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Consider a plane in three-dimensional space containing a region R and 
a line L. The solid generated by rotating R in space about L is called a solid 
of revolution. Among the solids of revolution are those described by rotating 
a portion of the graph of a function in the xy-plane about the x-axis (or the 
y-axis). We shall give an integral formula for the volumes of these solids, 
which is a special case of (4.1). Let f be a function integrable over the closed 
interval [a, b], and consider the solid of revolution Q swept out by rotating 
about the x-axis the region bounded by the graph of f, the x-axis, and the 


x-axis 


(b) 


Figure 15 


lines x = aand x = b. Sucha region is illustrated in Figure 15(a), and the 
corresponding solid in Figure 15(b). For every x in [a, 5], the cross section 
of Q perpendicular to the x-axis at x is a circular disk, of radius | f(x)|. Hence 


A(x) = a f(~))? = aLfONP. 


By formula (4.1), therefore, the volume of the solid of revolution Q is given 
by 


~b 
(4.2) vol(Q) = a | Lf (x) dx. 


EXAMPLE 3. Let R be the region in the upper half-plane bounded by the 
parabola y? = 4x, the x-axis, and the line x = 4. Find the volume of Q, the 
solid of revolution obtained by rotating this region about the x-axis. The 
region and the solid are drawn in Figure 16. That part of the parabola in the 
upper half-plane is the set of all points (x, y) such that y? = 4x and y > 0. 
This set is the graph of the equation y = »/4x, and we therefore take 
f(x) = 4x in formula (4.2), obtaining 


| ireoytax = x | 4x dx 


= 
== Ag > 


vol(Q) 


4 


OT. 


0 
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(a) (5) 


Figure 16 


EXAMPLE 4. Let R be the region in Example 3, and let S be the region bounded 
by the parabola y? = 4x, the y-axis, and the line y = 4. Both regions are 
shown in Figure 16(a). Find the volumes of the two solids of revolution Q, 
and Q», obtained by rotating S and R, respectively, about the y-axis. The 
two solids are illustrated in Figure 17. The easiest way to find the volume of 
Q,; is to use the counterpart of formula (4.2) for functions of y. The equation 
2 
y? = 4x is equivalent to the equation x = a ; sii the parabola is therefore 


the graph of the function of y defined by f(y) = —. By symmetry therefore, 


re 


vol(Qs) = 7 | Lfoydy = 78 dy 


x-axis 


(a) 


Figure 17 
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The union of Q, and Q, isa right circular cylinder of radius 4 and height 
4. Hence 
vol(Q; U Qe) = (714): 4 = 641. 


It is obvious that 


vol(Q; U Q2) = vol(Q:1) + vol(Q:). 


Hence 
vol(Q2) = vol(Q1 U Q2) — vol(Q1) 
647 2567 
= 647 — ee 


Another way of computing the volumes of certain solids of revolution 
is the method of cylindrical shells. It may be used, for example, to find the 
volume of the solid Q. in Figure 17(b) directly, and it constitutes another 
interesting application of the integral as a limit of Riemann sums. Let f bea 
function which is nonnegative and continuous at every point of a closed 
interval [a, b], where a > 0. Let R be the region bounded by the graph of f, 
the x-axis, and the lines x = aand x = b. We shall derive a formula for the 
volume of P, the solid of revolution obtained by revolving the region R about 
the y-axis (see Figure 18). 


X-axIS 
(a) (0) 
Figure 18 
Consider a partition @ = {xo,..., Xn} of the interval [a, b] such that 
ay ay Se ey SS 
and choose numbers x*,...,x* such that x;1 < xf <x; for each 


i= 1,...,n. Then r(x? — x%_,) is the area of the shaded annulus shown 
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in Figure 19(a), and af(x)(x? — x?_1) is the volume of the cylindrical shell 
of height f(x?) and thickness x; — x;_1 shown in Figure 19(b). The essential 
idea in the derivation is that, if x; — x,;_1 is small, then the volume of this 


y-axis 


(a) (5) 
Figure 19 


shell should be a good approximation to the volume of that part of the solid 
P that lies between the two cylinders of radii x;_1 and x;.. Hence if the mesh 
of the partition o is small, then the sum of the volumes of the shells, i.e., the 
sum 


yee = a 


t=1 


should be a good approximation to the volume of P. Specifically, we shall 
assume that 


vol(P) = dim 3 ep ae a. éf) 


|a||0 eee | 


On the basis of this assumption we now prove the theorem 


b 
(4.3 ) bon) = om | xfer 


Proof. Since each x; may be any number in the subinterval [x;_1, x;], let us for 
convenience choose it to be the midpoint. This means that x* = ae 
or, equivalently, that x; + x;_1; = 2x*. Hence 

ee Sea Xa) 


* 
2x; (X%: — Xi), 


2 2 
Xj — Xj-1 


sEC. 4] VOLUME | AA] 


and it follows that 


> ef — Ha) = D5 uxt £06 ers — xi). 
i=l i=1 


By our assumption (1), the left side of this equation approaches vo/(P) as a 
limit as the mesh ||o|| approaches zero. The right side is a Riemann sum for 
the continuous function 27xf(x) and therefore approaches the integral 


J 2nrxf(x) dx as a limit as ||o|| approaches zero. This completes the proof. 


The region R in Figure 16(a), when rotated about the y-axis, generates 
the solid of revolution Q, illustrated in Figure 17(b). Let us compute the 
volume of Q» by the method of cylindrical shells. The function f in the for- 
mula is in this case the one defined by f(x) = 4x = 2x'”. In addition, 
a = Oand b = 4. We therefore obtain 


4 
vol(Q2) ied x 2x""dx 
0 


4 
x3? 
0 


4 
tr | x ?dx = Ar: 
0 


dn 8-32 = “OT, 


onpo 


which agrees with the value obtained in Example 4. 


f(x) 
Sar 
= ies 


Figure 20 


It was pointed out in Section 6 of Chapter 4 that the dx which appears in 
{f (x) dx can be legitimately regarded as a differential. Moreover, it was 
remarked in Section 6 of Chapter 2 that the traditional attitude toward a 
differential is that it represents an infinitesimally small quantity. These 
ideas are a good aid to the imagination here, too. Thus, as the meshes of the 
subdivisions in the Riemann sums 771 f(x7)(x; — x;1) become in- 
finitesimally small, the differences x; — x;_1 become dx and the summation 
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> becomes the integral [. Consider the following rough-and-ready derivation 
of the formula used in the method of cylindrical shells. We imagine a shell 
of infinitesimal thickness at each x in the interval [a,b]. The radius is x, 
the thickness dx, and the height f(x) (see Figure 20). The circumference of 
the shell is 27x, and the area of the edge is the product of the circumference 
and the thickness, 27x dx. Multiplying by the height, we get the infinitesimal 
volume 2rxf(x)dx. The total volume is obtained by adding all the in- 
finitesimal volumes. We get > 27xf(x) dx or, actually, 


b b 
| 2rx f(x) dx = anf x LTRAX 


for the answer. 

The derivation of integral formulas by this process of ‘‘summing 
infinitesimals’’ is extremely useful both as a guide to memory and in helping 
one to discover the right formula in the first place. Of course, any such heu- 
ristic approach is justified mathematically only if it can be supplemented by 
careful analysis. 


PROBLEMS 


1. A solid Q has a flat base which is the region in the plane bounded by the 
parabola y” = x and the line x = 4. Each cross-section perpendicular to the 
x-axis 1S a Square with one edge lying in the base. Find the volume of Q. 

2. The solid P has the same base as Q in Problem 1, but each cross-section per- 
pendicular to the x-axis is a semicircular disk with diameter lying in the base. 
Compute vo/(P). 

3. A tetrahedron is a solid with four vertices and four flat triangular faces. Let T 
be a tetrahedron which has three mutually perpendicular edges of lengths 3, 4, 
and 10 meeting at a vertex. Draw a picture of T and compute its volume. 

4. The graph of the function f(x) = a2 — x? is a semicircle of radius a. Use 
this function and an integral formula for the volume of a solid of revolution to 
compute the volume of a sphere of radius a. 

5. Find the volume of the ellipsoid of revolution obtained by rotating about the 


x-axis the region bounded by the ellipse = a = an 
. 5 


6. Sketch and find the volume of each of the solids of revolution obtained by 
rotating about the x-axis the region bounded by the indicated curves and lines. 


(a) y= SP — 1x = 1 x= 2 and ihe 
(b) y = $x, x = 8, and the x-axis. 

(c) “pets 2x x eS ey = Oa =. 
1 


2 


(d) y= —,x = 1, x = 2, and the x-axis. 


bas 


1 — x? and the x-axis. 


ery 
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a 
8. 


10. 


ff. 


EZ. 


13. 


Find the volume of a right circular cone of height / and with a base of radius a. 

(a) Find the volume of the solid of revolution obtained by rotating about the 
y-axis the region bounded by the x-axis, and the graphs of y = x? — 1 
and.y = 3. 


(b) Using (a), find the volume generated by rotating the region in Problem 6(a) 
about the y-axis. (Use Example 4 as a model.) 


. Using the method of cylindrical shells, find the volume of the solid of revolution 


obtained by rotating each of the regions in Problem 6 about the y-axis. 
Sketch the region R in the plane which is bounded by the parabola (y — 1)” = x, 
the line y = 2, and the x-axis and y-axis. Find the volume of the solid of revolu- 
tion obtained by rotating R about the x-axis, using 


(a) formula (4.2) twice, ie., wf. “ y2dx once with y — 1 = Vx and again with 
yo1l= -—vx. 

(b) the counterpart of formula (4.3), i.e., the method of cylindrical shells, for 
functions of y. 


Since (x — fh)? + y? = a? is an equation of the circle with radius a and center 
at (h, 0), it follows by solving for y in terms of x that the graph of the function 
f(x) = Va — (x — A)? is a semicircle. 


(a) Assuming that / > a and using the method of cylindrical shells, write a 
definite integral for the volume of the solid torus (doughnut) with radii 
h and a. 


(b) Evaluate the integral in (a) by making the change of variable ypoxrx-h 
[see Theorem (6.6), page 215], and using the fact that fa V@ — y? dy = 


Ta’ a 
(area of a semicircle). 


In a solid mass of material, the infinitesimal mass dm of an infinitesimal amount 
of volume dv located at an arbitrary point is given by 


ani = p ae, 


where p is the density of the material at that point. 

Consider a cylindrical container of radius a filled to a depth / with a liquid 
whose density is greater at the bottom and less at the top. Specifically, at a point 
a distance x below the surface the density is given by p = 2 + x. What is the 
total mass of liquid in the container ? 

Same as Problem 12, but this time the container is a right circular cone (apex 
at the bottom) of height / and base of radius a which is filled to the top. 


5. Work. The concept in physics of the work done by a force acting on an 
object as it moves a given distance provides another important application of 
the definite integral. 


Throughout this section we shall consider only those situations in which 


the object moves in a straight line L, and in which the direction of the force 
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Fis also along L. Mathematically, F is a function, which may or may not be 
constant. We shall assume that L is a coordinate axis and that, for every 
number x on L, the value of the force F at x is equal to F(x). The sign con- 
vention will be as follows: F(x) > O means that the direction of the force at 
x 1s in the direction of increasing numbers on L, and F(x) < O means that 
the direction of the force is in the direction of decreasing numbers. 

We first consider the special case in which the force F is constant as the 
object moves along L from a to b. Thus F(x) = k for all x such that 
a <x <b. Then the work done by the force denoted by W, is defined by the 
simple equation 

W = k(b — a). (1) 


Frequently, we wish to speak of the work done against the force which we 
shall denote by W,,. This is just the negative of W. Hence 


We Ha Le = @. (2) 
EXAMPLE 1. Compute the work done in raising a 10-pound weight a distance 


of 50 feet against the force of gravity. We choose a coordinate axis as shown 
in Figure 21 with the origin at the initial position of the object. The mag- 


Figure 21 


nitude of the force is constant and equal to 10 pounds. Thus |F(x)| = 10. 
By our sign convention, however, F(x) is negative, and so F(x) = —10. 
The work done against the force of gravity in raising the weight is therefore 
given by 


W. = (—F(x))(b — a) = (—(-10))(50 — 0) 
500 foot-pounds. 
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Next, let us consider the problem of defining the work done by a force F 
which is not necessarily constant. We shall assume that the function F is 
integrable over the closed interval having endpoints a and b (it may be that 
a < bor that b < a). Then the work done by the force F as the object moves 
from a to b will be denoted by W(F, a, b) and is defined by 


b 


WW, a,b) = | F(x) dx. 


(3) 


Thus work depends on the function F and the numbers a and 5, and hence is a 
function of these three quantities. As in (1), we frequently abbreviate 
W(F, a, b) as simply W. 

Is this definition of work a reasonable one? The answer is yes only if 
WF, a, b) has the properties which correspond to the physical concept we are 
trying to describe. For example, we should expect that the work done by a 
force in moving an object from a to b plus the work done in moving it from 
b to c should equal the work done in moving it from ato c. This property is 
expressed in the equation 


(5.1) W(F, a, b) + W(F, b,c) = WF, a, ¢), 


which is an immediate corollary of the definition of W(F, a,b) and the 
fundamental additive property of the integral [see Proposition (4.2), page 191]. 
Second, the work done by a greater force acting on an object as it moves from 
a to b in the direction of the force should certainly be larger than the work 
done by a smaller force. This is expressed in the proposition 


(5.2) If Fi(x) < F(x) for every x such thata < x < b, then 
W(Fi, a, b) S W(Fo, a, b). 


This is also simply a restatement of one of the fundamental properties of the 
definite integral [see Proposition (4.3), page 191]. Finally, we note that the 
definition is consistent with the earlier one in equation (1). That is, if the 
force is constant, then the work is simply the product of the constant value 
and the change in position. Thus 


(5.3) If F(x) = k for every x in the closed interval with endpoints a and b, then 
W(F,.a, b) = k(b — a). 


The proof is just the elementary fact that | k dx = k(b — a) [see 
Proposition (4.1), page 191]. 
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We have just shown that work, as we have defined it, has three natural 
and apparently quite basic properties. This suggests that the definition is 
reasonable. Actually, we can conclude much more than that. We shall now 
show that our definition of W(F, a, 6) as a definite integral is the only one 
which has these three properties. That is, we have proved that the definition 
implies the properties, and we shall now prove, conversely, that the properties 
imply the definition. This is such an important fact that we state it as a 
theorem: 


(5.4) THEOREM. Let W be a function which assigns to every function F and 
any interval [a, b] over which F is integrable a real number W(F, a, b) such that 
(5.1), (5.2), and (5.3) hold. Then 


b 
W (F502; b= | F(x) dx. 


Proof. Let F be a function, and [a, 5] an interval over which F is integrable. We 
shall first show that, for every partition o of [a, b], the upper and lower sums, 
U, and L,, satisfy the inequalities 


Le & Wao) = U,. (4) 
To do this, we let ¢ = {xo,..., Xn} and assume the usual ordering: 
io ker sky So See ae 


As we have done in the past, we denote by M; the least upper bound of the 
values of F on the ith subinterval [x;_1, x:;], and by m; the greatest lower 
bound. Then 


m; < F(x) < M,, whenever x;-1 < x < X. 


The two constant functions with values M; and m,, respectively, are certainly 
integrable over the subinterval [x;_1, x;]. Following the common practice 
of denoting a constant function by its value, we know, as a result of (5.2), that 


Wiig teks aed AE eG ee aa 
Using (5.3), we obtain 
Wn, 41, KO) = anes Bi; 


W(M,,. X11, XD) = TR Sa - 
Hence 
DAG Bea) Se pee A 


Adding these inequalities for i = 1,...,n, we get 


y; Maixe Tay 9 Wal, Aiats 25) 5 Ns: 3 
hed ech at 
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it. Consider the spring shown in Figure 22, which is 1 foot long when under 
no tension. A 5-pound load B attached to the end of the spring has stretched 
it to a length of 2 feet (i.e., an additional 1 foot from rest position). How much 
work is done by the restoring force of the spring if the load is raised 4 foot? 


x = 0 (rest position) 


direction of increasing x 


Figure 22 


We choose a vertical x-axis with increasing values of x pointing down, and 
for convenience take the origin to be the rest position. For x > 0, the re- 
storing force F(x) of the spring is upward and, therefore, F(x) < 0. It 
follows by Hooke’s Law that 

F(x) = —kx, 


for some positive number k. To find the constant k, we use the fact that the 
5-pound load B has stretched the spring 1 foot. Hence F(1) = —5, from 
which it follows that —5 = —k-landsok = 5. Thus 


- Pia) = — Sy. 


In raising B a distance of 3 foot, the movement is from x = 1 to x = 3. 


Hence the work W done by the restoring force Fis given by 


1/2 1/2 
w-| Fede = | (—5x) dx 


1 


= 9 

x 

Se 
[sea 2 


1/2 


= 5($ — $) = ¥ foot-pounds. 


The work done against the force F as the object moves from a to b will 
be denoted by W,(F, a, b), or simply by W, as before, and is by definition 
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the negative of the work done by the force F. Thus 


b 


W(F, a, b) = af F(x) dx. (6) 


EXAMPLE 3. Consider the spring in Example 2 loaded as shown in Figure 22. 
How many foot-pounds of work are required to pull the load B down an 
additional 1 foot (i.e., so that the spring is stretched to a total length of 3 feet)? 
The total, or resultant, force F(x) acting at x is the sum of two forces: The 
first is the restoring force of the spring, which we have computed to be —5x, 
and the second is the force of gravity on B, which is equal to 5. (We ignore 
the weight of the spring.) Hence 


fi == 5x 


— 


The work required to pull the load B down an additional 1 foot, 1.e., to move 
from x = 1 to x = 2, will be the work done against the resultant force F. 
Thus 


y= = | Pivjay = oe (5 — 5x) dx 


2 


5x” 
: @ = sx) 


EXAMPLE 4. According to Newton’s Law of Gravitation, two bodies of masses 


= 3 foot-pounds. 


M and mare attracted to each other by a force equal in magnitude to G ai 
where + is the distance between them and G is a universal constant. If the 
earth has mass M, find the work done in projecting a missile of mass m 
radially outward 500 miles from the surface of the earth. Let the center of 
the earth be fixed at the origin of an axis along which the missile is projected 
in the direction of increasing x, as shown in Figure 23. By Newton’s Law, the 
gravitational force F(x) acting on the missile at x is toward the origin and 


M. 
equal in magnitude to G —— . By our sign convention, therefore, 
x 
Mm 
A 
Let a = 4000 miles, the radius of the earth, and let b = 4500. The work 


required to project the missile from a to 6 is equal to the work Wx done 
against the force F in moving from a to b. 


F(x)= -—G 


450 THE DEFINITE INTEGRAL (CONTINUED) [cuap. 8 


missle 


Hence 


b 
a F(x) dx 


b b 
= ae —6 4" ax = Gam | —~ dx 


I 
a 
S 
3 


x x? 
b 
(- ‘) 2s (2 . ;) 
oe aie : as i, 
| l 
eee Ge = au) | 
Suppose that we consider the work required to project the missile from the 


surface of the earth to points successively farther and farther away. We find 
that 


a a (2 a5 ;) 
bis0 bse a b 
_ GMm 
a 


This number can be regarded as the work necessary to carry the missile com- 
pletely out of the earth’s gravitational field. In this example we have, of 
course, ignored the gravitational forces which exist because of the presence 
of other bodies in the universe. 


PROBLEMS 


1. Compute the work in foot-pounds done by the force of gravity when a 50-pound 
rock falls 200 feet off a vertical cliff. 

2. Compute the work in foot-pounds done against the force of gravity in raising a 
10-pound weight vertically 6 feet from the ground. 

3. Acar on a horizontal track is attached to a fixed point by a spring, as shown in 
Figure 24. The spring has been stretched 2 feet beyond its rest position, and the 
car is held there by a force of 10 pounds. If the car is released, how many foot- 
pounds of work are done by the restoring force of the spring in moving the car 
2 feet back to the rest position? 
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Figure 24 


4. An electron is attracted to a nucleus by a force which is inversely proportional 
to the square of the distance r between them; 1.e., - . If the nucleus is fixed, 
compute the work done by the attractive force in moving the electron from r = 2a 
tor = a. 

5. A container holding water is raised vertically a distance of 10 feet at the constant 
rate of 10 feet per minute. Simultaneously water is leaking from the container 
at the constant rate of 15 pounds per minute. If the empty container weighs 
1 pound and if it holds 15 pounds of water at the beginning of the motion, find 
the work done against the force of gravity. 

6. Suppose that a straight cylindrical hole is bored from the surface of the earth 
through the center and out the other side. An object of mass m inside the hole 
and at a distance r from the center of the earth is attracted to the center by a 


= mer 
gravitational force equal in absolute value to = , where g is constant and R 1s 


the radius of the earth. Compute the work done by this force of gravity in terms 
of m, g, and R as the object falls 


(a) from the surface to the center of the earth, 


(b) from the surface of the earth through the center to a point halfway between 
the center and surface on the other side, 


(c) all the way through the hole from surface to surface. 


[Hint: Let the x-axis be the axis of the cylinder, and the origin the center of the 
earth. Define the gravitational force F(x) acting on the object at x so that: 
(i) its absolute value agrees with the above prescription, and (ii) its sign agrees 
with the convention given at the beginning of this section.] 


7. Consider a cylinder and piston as shown in Figure 25. The inner chamber, which 
contains gas, has radius a and length 6. According to the simplest gas law, the 
expansive force of the gas on the piston is inversely proportional to the volume 

k : 
v of gas; i.e., F = — for some constant k. Compute the work done against this 
v 
force in compressing the gas to half its initial volume by pushing in the piston. 


a 


Figure 25 
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8. A rocket of mass m is on its way from the earth to the moon along a straight line 
joining their centers. Two gravitational forces act simultaneously on the rocket 
and in opposite directions. One is the gravitational pull toward earth, equal in 

Mim 


G ea 
absolute value to , where G is the universal gravitational constant, M, 


ry 
the mass of the earth, and r; the distance between the rocket and the center of the 
earth. The other is the analogous gravitational attraction toward the moon, 


Mom 


G 
equal in absolute value to , where M, is the mass of the moon and rz 


re 
is the distance between the rocket and the center of the moon. Denote the radii 
of the earth and moon by a and J, respectively, and let d be the distance between 
their centers. 


(a) Take the path of the rocket for the x-axis with the centers of earth and 
moon at 0 and d, respectively, and compute F(x), the resultant force acting 
on the rocket at x. 


(b) Set up the definite integral for the work done against the force F as the 
rocket moves from the surface of the earth to the surface of the moon. 


6. Integration of Discontinuous Functions. If a function f is continuous at 
every point of an interval [a, 6], then we know that f is integrable over 
[a, b] [see Theorem (5.1), page 199]. Continuity is certainly the most im- 
portant criterion for integrability that we have. For example, in the funda- 
mental theorem of calculus it is assumed that the integrand is continuous 
over the interval of integration. However, it is important to realize that a 
function does not have to be continuous to be integrable and that there are 
many simple discontinuous functions which can be integrated. 
We begin with the following theorem: 


(6.1) If f is bounded on [a, b] and is continuous at every point of [a, b] except 
Possibly at the endpoints, then f is integrable over [a, b]. 


Proof. If a = b, the conclusion follows at once since ri ‘ i, J = 0.. Hence we 
shall assume that a < b. To be specific, we shall furthermore assume that f 
is continuous at every point of [a, 5] except at a. The necessary modification 
in the argument if a discontinuity occurs at b (or at both a and 5) should be 
obvious. According to the definition of integrability (page 168), it is sufficient 
to prove that there exist partitions o and 7 of [a, b] such that U, — L,, the 
difference between the corresponding upper and lower sums, is arbitrarily 
small. For this purpose, we choose an arbitrary positive number e. Since f 
is bounded on [a, 5], there exists a positive number k such that | f (x)| te |S 
for every x in [a, b]. We next pick a point a’ which is in [a, b] and sufficiently 
close to a that 


ss eo 
O' <a nek 
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(see Figure 26). Since fis continuous on the smaller interval [a’, 5], we know 
that f is integrable over it. Hence there exist partitions o’ and 7’ of [a’, 5] 
such that the upper sum U,, and lower sum L,, for f satisfy 


é 
[Uo — Lr] < 3° (1) 
y-axis 
i y=f(o) 
a a’ oe. b X-axis 
k 


Figure 26 


Let o and 7 be the partitions of [a, b] obtained from o’ and 7’, respectively, 
by adjoining the point a; i.e., o = o’ U {a} and r = r’ U {a}. Since the 
maximum value of | f(x)| on the subinterval [a, a’] is less than or equal to k, 
it follows that 


em We — a) eg, =o: (2) 
By the same argument, we have 
iE. = Ee ea = aye SS (3) 


Next, consider the algebraic identity 
UG, -L, = eck ss x Oe) (L,’ _— L,) i (Uz: ee L,). 


The sum of three numbers is always less than or equal to the sum of their 
absolute values. Using this fact and the inequalities (1), (2), and (3), we obtain 


Uz — Ly < |Us — Un'| + |Le — Lol + [Uae = Les 
tS E E 
a et 


Thus there exist upper and lower sums lying arbitrarily close to each other, 
and the proof is complete. 
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EXAMPLE |. Let f be the function defined by 


eer ( ‘ 
sin —> it x Ged), 
f(x) = x 
0 its = 0. 


This function is continuous everywhere except at 0, and its values oscillate 
wildly as x approaches 0. The graph, for values of x in the interval [0, 2], is 
shown in Figure 27. Since | f(x)| < 1 for every x, the function is bounded on 
every interval. It therefore follows by Theorem (6.1) that f is integrable over 
[0, 2]. 


Figure 27 


An important extension of Theorem (6.1) is the following: 


(6.2) If f is bounded on [a, b] and is continuous at all but a finite number of 
points in the interval, then f is integrable over [a, b]. Furthermore, if ai, ..., Qn 
are the points of discontinuity and ifa < ay < +++ < a < 5b, then 


fe-f alee 


Proof. It is a direct corollary of (6.1) that f is integrable over each subinterval 
[a, ai], [ai, ao],...,[an, b]. By repeated applications of Theorem (4.2), 
page 191,we may then conclude that f is integrable over [a, b] and that 


b ay b 
tee pects | sf 


This completes the proof. 


SEC.6] INTEGRATION OF DISCONTINUOUS FUNCTIONS 455 


Consider the function f, whose graph is shown in Figure 28, and which is 
defined by 


0 —-e2<x< —-l, 

2 i 
oS a pe 3. 

—] a ee. 


y-axis 


Figure 28 


A function whose domain is the entire set of real numbers and whose 
range is only a finite set of real numbers is called a step function. Such a step 
function is continuous at all but possibly a finite number of points, and in the 
present example the discontinuities occur at —1, 2, and 3. It follows by 
Theorem (6.2) that fis integrable over any closed interval. In particular, 


| ged jas 


For each of the three integrals on the right side of the preceding equation, 
the integrand f is constant on the interval of integration except possibly at 
the endpoints. If we think of an integral as area or as an average value, we 
shall almost certainly support the conjecture that the value of an integral is 
not affected by isolated discontinuities in the integrand. Thus we expect that 


[r-[refhcefs 


ee ee er ee 
ge gs ee oS 8 
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This conjecture is correct (hence, so is the preceding computation), and is 
implied by the next theorem. 


(6.3) Let [a, b] be a subset of the domains of two functions f and g, and let 
F(x) = g(x) for all but a finite number of values of x in [a, b]. If fis integrable 
over [a, b], then so is g and if = t g. 


Proof. It is sufficient to prove this theorem under the assumption that the values of 
f and g differ at only a single point c in the interval [a, b] (because the result 
can then be iterated). To be specific, we shall assume that f(c) < g(c). The 
proof is completed if we can show that there exist upper and lower sums for g 
which differ from the integral }. f by an arbitrarily small amount. For this 
purpose, we choose an arbitrary positive number e. Since f is, by hypothesis, 
integrable over [a, 5], there exists a partition 7 of [a, b] such that the corre- 
sponding lower sum for f/, which we denote by L,(/), satisfies 


b 


However, every lower sum for f is also a lower sum for g. Hence we may 
substitute L,(g) for L,(/) in the preceding inequality and obtain 


b 


We next derive a similar inequality involving an upper sum for g. The 
integrability of f also implies the existence of a partition o’ of [a, b] such that 
the corresponding upper sum for / satisfies 


b 
€ 
U,'(f) a | ci < 3 3 
By possibly adjoining to o’ a point on either side of c, we can assure ourselves 
of getting a partition og = {xo,..., Xn} of [a, b] with the property that if c 
lies in the ‘th subinterval [x;_1, x,], then 
€ 


2g) — fo) 


We have already shown [see the proof of (1.1), page 168] that if one partition 
o is a refinement of another o’ (i.e., if o’ is a subset of oc), then U, < U,). 
Thus U,(f) is, if anything, a better approximation to Pe a than U,/(f). 
Hence 


(5) 


Xp Kt <S 


b 
uin- | £<é (6) 


Let M; and N; be the least upper bounds of the values of f and g, respectively, 
on [x;_1, x;]. Since f(x) = ey) except atc, it icllows that 


Ug) 2A) Sa = Me: = eee 


SEC.6] INTEGRATION OF DISCONTINUOUS FUNCTIONS 457 


But the difference VN; — M; can be no more than g(c) — f(c). Hence 
C= UA) = eo FOI = xi~2. 


and this inequality combined with (5) yields 
E 
U(g) — Ulf) <5 (7) 


Finally, adding the inequalities (6) and (7), we obtain 


b 
U,(g) oe | 7, < €. 


This is the analogue of (4) and completes the proof. 


EXAMPLE 2. Let f be the function defined by 


xe —-» <x<J, 
aS 42 x OG x & 2, 
2x — 5 aK a oe 


The graph of f is drawn in Figure 29. The function is clearly continuous 
except at 0 and at 2, and is bounded on any bounded interval. It follows by 
Theorem (6.2) that fis integrable over the interval [— 1, 3] and that 


pedis {co fs 


y-axis 


Figure 29 
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For every x in [—1, 0], we have f(x) = x’, and so 


0 
Ey =e dx = 5 —— 
oe 1 


For every x in [0, 2], we have f(x) = 2 — x? except that f(0) = 0. Hence, 
by Theorem (6.3), 


| f=] @=s3a6 (2x - 5) 


Similarly, f(x) = 2x — 5 for every x in [2, 3] except that f(2) = —2. Again, 
by Theorem (6.3), 


r-[o 


Hence 


2 


coli 


0 


= —6 — (-6) = 0. 


_ fa -kt+et0= 435 


EXAMPLE 3. Is each of the following integrals defined? 


(a) | sin ~ dx, 
Inx 
(b) ie oe 


= (@) 2 tan x dx. 


This is the same as asking whether or not each function is integrable over its 
proposed interval of integration. Strictly speaking, the answer is no in every 
case, because each function fails to be defined at one of the endpoints of the 
interval. However, Theorem (6.3) shows that this answer is based on a 
technicality and misses the real point of the question. If a function / is 
bounded on an open interval (a, 6) and if f(a) and f(b) are any real numbers 
whatever, then fis also bounded on the closed interval [a, b]. Let us suppose, 
therefore, that fis bounded and continuous on the open interval (a, 56). We 
may choose values f(a) and f(6) completely arbitrarily, and the resulting 
function will be integrable over [a, b] as a result of Theorem (6.1). Further- 
more, by Theorem (6.3), the integral ‘a f is independent of the choice of 
f(a) and f(b). Hence, if f is bounded and continuous on (a, 6), we shall 
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certainly adopt the point of view that fis integrable over [a, b] and, equiv- 
alently, that fo fis defined. 


l 
Following this convention, we see that the function sin — is bounded 
i 
bo ; : 
and continuous on (0, 1), and so | sin — dx is defined. Using L’H6pital’s 
x 


0 


Rule (page 123), one can easily show that 


prem Be 2y 
x—1+ I — 
l ice 
Hence, : = is bounded and continuous on (1, 2), and so | | = dx 
— x 1 


exists. On the other hand, 


im - tan x =o, 
xr— (1 | 2)— 


and we therefore conclude that tan x is not integrable over the interval 


PROBLEMS 


1. Determine whether or not each of the following functions is integrable over the 
proposed interval (see Example 3). Give a reason for your answer. 


ye 
(a) cos i over [0, 1] (c) a se ee > over [0, 2] 
¥ x — 1 
He 2 
(b) wtxas > over [0, 1] (d) as over [0, 2]. 


2. Is each of the following integrals defined? (See Example 3.) Give a reason for 
your answer. 


1 1/e 
sin x 1 
(a) | 3 dx (d) | ae dx 
1/2 D e 
(b) | eee (e) | nate: 
0 x 0 


1 
(c) [haw 
ae, 5 


3. Draw the graph of f, and evaluate f(x) dx in each of the following examples. 


1 if—«o <x< 0, 
(a) f(x) =45 10 = 5 2. 
2 if 2 eee 


and [a, b] = [—3, 3]. 


460 THE DEFINITE INTEGRAL (CONTINUED) [cuHap. 8 


if—o <x <0, 
x HO <x < @, 


and [a, b] = [—2, 2]. 


Onn ae 


(c) le) = Fi nee en a1 where v is any integer, 
and [a, b] = [0, 5]. 


4. Prove that if a function / is bounded on an open interval (a, b) and, if f(a) and 
f(b) are any two real numbers, then f is also bounded on the closed interval 
[a, bd). 


5. Compute 
1 t 
1 T 
(a) lim — dx (b) lim tan — x dx. 
io Jt *% Pie, ee Z 


How does the result give insight into the fact that neither integrand is integrable 
over the interval [0, 1]? 


6. A function f is said to be piecewise continuous on an interval [a, 5] if it is con- 
tinuous at all but possibly a finite number of points of the interval, and if, for 
every point c of discontinuity in the interval, there exist numbers & and / such that 

link fi =e and hae Jy = 4 
r— c+ rI— c— 


Using Theorems (6.2) and (6.3), prove that if f is piecewise continuous on 
[a, b], then it is integrable over [a, 5]. 


7. Improper Integrals. It is assumed in the definition of integrability 
(pages 168f) that if a function is integrable over an interval, then it 1s neces- 


I 
sarily bounded on that interval. Hence the function f defined by f(x) = = 


is not integrable over [0, 1] because it satisfies neither condition for bounded- 
ness: The number 0 is not in the domain of f, and there is no upper bound 
for f [for values of x near zero, f(x) becomes arbitrarily large]. The fact that 
f(O) is not defined is not a serious difficulty because, as was proved in Sec- 
tion 6, the values of a function at any finite set of points can be defined 
arbitrarily without affecting the integrability of the function. Thus we could 
set 


1 
f(x) se a ft xe 0, 
3 if x= 0; 


and thereby satisfy the first condition of boundedness. However, there is no 
way to make f bounded on [0, 1] by changing a finite number of its values. 

In this section we shall show that it is possible to extend the concept of 
integrability to include many functions which are not bounded on their 


SEC. 7] IMPROPER INTEGRALS 461 


intervals of integration. In addition, the extensions will allow the possibility 
of integrating over intervals which are not bounded. These integrals are 
called improper integrals. Two examples are 


,+ 0% unbounded integrand, 


ry e: ‘ax unbounded interval. 
0 


Let (a, b] be a half-open interval (containing b but not a), and let f be a 
function which is integrable over the closed interval [t, b] for every number 
tin (a, b]. The integral / is thus defined if a < t < 6b, and our definition 
will concern the limit 


im ff ee. 


t>a+ 


We consider the following three cases: 


(i) The function f is bounded on (a, b]. It is not difficult in this case to 
prove that fis integrable over the closed interval [a, b] and, in addition, that 
the limit (1) exists and is equal to i’ f. [if ais not in the domain of f, we define 
Ff (q@ arbitrarily. ] 

(11) The function f is not bounded on (a, b], but the limit (1) exists. In 
this case f is not integrable over [a, b] according to our original definition. 
Hence we define the improper integral, which is still denoted by { fi to De 
the limit (1). 

(iii) The limit (1) does not exist. In this case the integral is not defined. 


Thus, if the limit exists, we have the equation 


Sis lim f 
t—a+ 


If fis bounded on (a, 6], the integral is called proper. If fis not bounded on 
(a, b], the improper integral exists only if the limit exists. However, the 
traditional terminology, which we shall adopt, is that the improper integral 
is convergent if the limit exists and divergent if it does not. 

In spite of the improper integrals-defined in this section, we emphasize 
that whenever we say that fis integrable over [a, b] we mean it in the sense of 
the original definition of integrability, in which [a, b] is a bounded interval 
and fis bounded on it. 
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The situation is analogous if [a, b) is a half-open interval and / is in- 
tegrable over [a, t] for any ¢t in [a, b). We have 


If fis bounded on [a, b), then f is integrable over [a, b], and f : f is a proper 
integral. If fis not bounded, the integral is improper, and it is convergent 
or divergent according as the limit does or does not exist. 


EXAMPLE |. Classify each of the following integrals as proper or improper. 
If improper, determine whether convergent or divergent, and, if convergent, 
evaluate it. 


1 
(a) | a (c) ia rs age 
a4 ere 
(b) | 3 dx, (d) | sin ~ dx. 
Since 2. takes on arbitrarily large values near 0, we know that 


| : 
| —— dx is not a proper integral. For every ¢ in (Q, 1], 
0 Vx 
: | Ace = 23 = Dl a a/ 0: 
Since lim 2(1 — +/f) exists, we get 
t—o+ 


| eae lim 201 — Vt) = 2. 


t—0+ 


Hence (a) is a convergent improper integral with value 2. 
I . . 
The values of — also increase without bound as x approaches zero, and 
x 


(b) is therefore not a proper integral. For every ¢ in (0, 1], 


Hi fie=4 
Paes ace 
ee X/t t 
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However, 


uh 
lim 2 dx = lim Dept D, 
x? 


t—0+ t—0+ 


and, since the limit does not exist, the improper integral is divergent. 


The function = is not bounded on [0, 2), and so (c) is also an 


improper integral. For every ¢ such that 0 < t < 2, we have 


t 
l t 
f toa = - mp - af 


2 


Hence 


t 
l 2 
lim | — = i ie eae oor 


t—>2— 


2 
l 
and we conclude that | 5 
0 


dx is a divergent improper integral. 


| | 
Since | sin . < 1 for all nonzero x, the function f defined by f(x) = sin — 
x 


is bounded on (0, 1]. It is also continuous at every point of that interval. 
We now assign a value, say 0, to f(0), and it follows by Theorems (6.1) and 
(6.3) that f is integrable over [0, 1], and the value of { Feodxi is independent 


of the choice of f(0). As in Section 6, we therefore consider sin — dx to be 
0 XxX 


a proper integral. 


We next define improper integrals over unbounded intervals. Let a bea 
given real number and f a function which, for every ¢ > a, is integrable over 
[a, t]. If the limit rae f exists, we define it to be the value of the con- 


vergent improper sage & +, Thus 


If the limit does not exist, the integral of f over [a, «) also does not exist. 
Although it is not defined, we follow tradition and say that the improper 
integral is divergent. 
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As before, the analogous definition is given for the unbounded interval 
(—a%,a]. We have 


and the improper integral i f is convergent if the limit exists, and divergent 
if it does not. 


EXAMPLE 2. Test the following improper integrals for convergence or diver- 
gence, and evaluate the convergent ones. 


Gi | a, é | _s 
0 ae Se 
oe) t 

(b) | a (a). 
1 \/x ; —o + 


For (a) we have 


Hence 


| e"dx = lim(l—e )=1-—-0=1, 
0 t% 


and so the integral is convergent and equal to 1. 
Similarly, for (b), 


t 
| —|' ; 
—dx=2 = J4/h— 7; 
| ee vx, 4 


; si 
However, since lim (2\/t — 2) o, we conclude that | —=dk ie 


t0 1 <a 


divergent integral. 
For (c) we obtain 
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From this it follows that | ~, dx is convergent and equal to 3, since 
x 
2 


a4 it 1 
| dee im (5-3) = 3° 


If the integral in (d) exists, its value depends on ¢. Hence in testing for 
convergence, we use another variable. 


= arctan ?f — arctans. 
s 


t 
l 
| ea ax = arctan x 


Since lim arctan s = — —, we conclude that 


s>—@ 


ail 
2 


t 
: = + = a = — (arctan t — arctan s) 


arctan ¢ + 5 ) 


and the integral is convergent for all real values of t. 


We next enlarge the class of improper integrals to include integrands 
which are unbounded near both endpoints of the interval of integration. Let 
(a, c) be an open interval (we include the possibility thata = —ow,orc= 0, 
or both), and let f be a function which is integrable over every closed subinter- 
val [s, f] = (a, c). Choose an arbitrary point 6 in (a, c), and consider the two 


integrals ig f and 1. Jy) Uf either of these is eprapet their sum is equal to f. o 
and we need no new definition. If both f f and i, , J are improper integrals, 
then we define the improper integral iy 7 to Be their sum. Furthermore, 
f fis defined to be convergent if and only if both if fand a f are convergent; 


otherwise, iB _ J 1s defined to be divergent. Thus, in all cases, we have the equa- 
tion 


(2) 


For the definition to be a valid one, it is necessary to know that [ ‘ , as 
defined in (2), is independent of the choice of b. Hence, we need the following 
simple theorem, whose proof is left as an exercise. 
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(7.1) If f is integrable over every closed subinterval [s, t] of (a,c), and if bi 
and b, belong to (a, c), then 


ieee =e, 


EXAMPLE 3. Classify each of the following improper integrals as convergent 
or divergent. Evaluate, if convergent. 


w | 20 | 
(a) | 2 dx, (b) . T+ x BX. 


For (a) we write 
= a 
| Lax- | Lact | a, 
Pe eee Sa 0 x ne eee 


1 
1 
We have already shown in Example 1(b) that | — dx is divergent, and it 
ee 


a 
follows that | — dx is divergent. For (b) we have 
if eS 


zs 0 2 
| | | 
[hee | oteet | tee 


and 
0 0 
= ry a = a arctan x 
" = = (O — arctanft) = — (- *) = = 
Similarly, 


= dx.= lim arctan ¢t = = 
0 I 3 x? tn . 


ihe 


l 
Hence e =e dx is a convergent integral equal to 3 + - = oR, 


As a final extension of the class of improper integrals, we include the 
possibility that the integrand may be unbounded near a finite number of 
points in the interior of the interval of integration. Let (a, 6) be an open 
interval (including possibly a = —x, or b = x&, or both), let a,..., a 
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be points of (a, b) such that ay < +++ < a,, and let f be a function which is 
integrable over every closed bounded subinterval of (a, b) which contains 
none of the points a1,...,@,. Then the equation 


(3) 


is either a consequence of the theory so far developed, or is taken as the 
definition of the improper integral [. f. As before, : fis divergent if any one 
of the integrals on the right is divergent, and is otherwise either convergent or 


proper. 


EXAMPLE 4. Classify each of the following integrals, and evaluate any which 


are not divergent. 
1 
a 
(a) ote ax, 


1 
1 
(b) ie 


(c) | ca ee ae 


Since each integrand has arbitrarily large values near one or more points of 
the interval of integration, we conclude that none of the integrals is proper. 
For (a) we first observe that 


from which we obtain 


cage a 
ax == tim | me ay 
ax aS sae 


= 9 37 
ad lim ol Sadie: Miele 


t—>0— 


and, in the same way, 


“4 
| 
| = dx lim = dx 
gx” 1506 


= 2 — i“ a = #, 
t—>0+ 


iw) 
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From the definition in (3) it follows that 


1 0 if 
l l l 
| ae eee | xia 
=-3+9=0. 


Hence (a) is a convergent improper integral equal to 0. 
If (b) is convergent, it follows from the definition that 


yy 2 - 
i Lag = [ Lacy [ Lax, 
pen = ie 0 x 


and that both integrals on the right are convergent. However, 


1 1 
| I ae = iim ur 
0 x io Jt 


lim Un} —inpj= ~, 
t—+0+ 


1 


— dx is divergent. 
se 


0 
l 
and | — dx is similarly divergent. We conclude that | 
oe 8 ‘ 


ae I 
(Warning: Failure to note the discontinuity of the function — 
at O can result in the following incorrect computation: ee 


1 
| = = nia 
a 2! 


If the integral (c) is convergent, then it is given by 


1 


=~Q0—O=0.) 


3 ‘a 1 “ l 
| eane—y*- | ene—n*t | ee —9* 


and both integrals on the right are convergent. However, it is easy to show 
that neither is convergent. A partial-fractions decomposition yields 


I 
C=) = 3 


1 See 
2x-—1 2x-—3 


and so 


G= pea a = —4In|x sn l| + + In|x — 3| + Cc 


i 
> Mi 


ad + 
x— 1 
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In particular, therefore, 


1 1 t 1 
, ae) Roiee Se a)” 


‘= 4) — 4|n3 = a, 


2 


Lie) 


which is sufficient to establish that (c) is divergent. 


= (im 5 In 


i —— 


We conclude this section with a theorem which gives a convenient test 
for the convergence or divergence of improper integrals. Called the Com- 
parison Test for Integrals, it can frequently be used to classify an improper 
integral whose integrand has no simple antiderivative, such as fe are” dx. 


(7.2) COMPARISON TEST FOR INTEGRALS. Let f and g be integrable over 
every bounded closed subinterval of a not necessarily bounded interval (a, b). If 
f(x)| < g(x) for every x in (a, b) and if { : g is either convergent or proper, then 
[’ f is also either convergent or proper. 


Since an open interval can be split into two pieces, this theorem also 
holds for half-open intervals. For simplicity, we shall prove it for the interval 


(a, b]. 


Proof. We first prove that J |f| is either convergent or proper. We shall assume 
without proof the theorem which states that if a function / is integrable over an 
interval, then so is |f|. [In most applications of (7.2) the function f is con- 
tinuous at every point of (a, b]. In this case, || is also continuous and the 
problem does not arise.] Since 


0 = 7) < efx), (4) 


for every x in (a, b], it follows that 


b b 
| ris / g, 


i : : po: 
for every ¢ in (a, b]. Since g has nonnegative values, ee g increases as f ap- 
proaches a from the right. Hence 


b b b 
ss lim | | g, 
iat to>a+ Jt a 
bd b 
| tS / g, 


1b : 
for every ¢ in (a, 5]. But a | f| also increases as tf approaches a from the right, 
and the preceding inequality shows that it is bounded from above by the 


and therefore 
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b : ios 
number . g. An increasing bounded function must approach a limit. Hence 
; b : b prt 
lim J, |f| exists, and therefore J, || is either convergent or proper. 


ae —f (x) < |f(x)], it follows that 
0-< fi) + |f&)| = 22), (5) 


for every x in (a, b]. In this part of the proof, the inequalities (5) are the 
analogues of (4). In exactly the same way as in the preceding paragraph they 


imply that ‘ , 
| +l <2/ g 


b re 
and thence that the integral iB (f+ if) is either proper or convergent. 
Finally, therefore, we have 


im f f= Jim wf edo web wl ses Pane af eae 


Since both limits on the right exist, so does the one on the left. We conclude 
that _ fis either proper or convergent, and the proof is complete. 


EXAMPLE 5. Prove that . Cae is convergent. Since x? > x whenever 
x > 1, it follows that e~* < e~* for x > 1. An exponential is never 
negative, so e* = |e~*’|, and therefore 


ier es ot en for xX > F 


The convergence of ‘f e “dx, shown in Example 2, implies the convergence 
of ps e “dx. It follows by the comparison test, i.e., by Theorem (7.2), that 


Bo ue. . ‘ ‘ : é : 

: e**dx is a convergent integral. This, in turn, implies the convergence of 
ive) = pe 

fe eae 


PROBLEMS 


1. Classify each of the following integrals as proper, improper and convergent, or 
improper and divergent. Evaluate any which are convergent if an indefinite 
integral can be found. 


1 
(a) ie ——d ee iy 


(b) ee — (e) i tanx dx 


3 
1 sin x 
(c) | tos 2oDA dx af —— dx, 
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2. Classify each of the following integrals and evaluate any which are not divergent. 


(a) fda 
a. A 
2 

(b) [ da 
0 x 


(c) | (x —x+1)dx 
=| 


(d) | xe dx 
0 


(e) J sin xa 
a 
1 
(f) | e dx 
(g) Pe 
eh, te 2 


1 
(h) | ie ee. 
0 


1 
3. Show that the integral | = de iS 
0 


(a) properif—«o <s <0. 
(b) improper and convergent ifO0 < s < 1. 
(c) improper and divergent if1 <s < ~. 


. For what values of s is the integral | — convergent, and for what values is it 
1 #* 


divergent? Give reasons for your answers. 
. Classify each of the following integrals, and evaluate any which are not diver- 
gent if an indefinite integral can be found. 


(a) = (f) [ a 
: : 
(b) | a pe a Merce on 
: 
- I “a ad | aries 


(d) / <a © f A> ate 
J 0 
1 ] 
Be X 
©) Lew 2p of hes 


6. Prove Theorem (7.1). 
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7. Using the Comparison Test for Integrals if necessary, classify each of the 
following integrals. 


0 0 
aS ' 
(a) a (d) | =e ee 
Se Pe 8 y/(x — 1) — 2) 2 
200 1 
(b) | = sin x dx (e) | x sin : dx 
{f° 0 x 
ore) i 
(c) i e sin x dx (f) | a ax: 
0 0 V(x — 1)(x — 2) 


8. If F(t) = S_, e~dx, find F’(0) and F’(1). 
9. (a) Show that the area of the region P bounded by the x-axis, the line x = 1, 


— 
and the curve y = — is infinite. 
x 


(b) Show that the volume of the solid of revolution obtained by rotating the 
region P about the x-axis is finite. 


10. Prove that if f is bounded on (a, 5] and integrable over [t, b] for every ¢ in 
b b 
(a, b], then / is integrable over [a, b] and lim i ; = a f. (Hint: The argument 


‘(5a 


a 
is essentially the same as that in the proof of Theorem (6.1).] 


CHAPTER Y 


Infinite Series 


Addition of real numbers is basically a binary operation: Given any two 
real numbers a and 5b, there is defined a real number denoted by a + 6b and 
called their sum. The sum of numbers qaj,...,d,, where nm > 3, is then 
defined by repeated applications of the binary operation. For example, one 
way of grouping the terms 1s given by 


E ae (((ay =F a») == a3) = as) aig ees “f— o) ies 


The Associative Law of Addition implies that the sums obtained by all the 
different possible groupings are the same; so we can discard the parentheses 
and write 


Thus addition of any finite number of terms is defined. However, without 
further definitions, the sum of an infinite number of terms makes no sense 
at all. In this chapter we shall make the necessary definitions and develop 
the theory of infinite series. Two examples are 


= 1 
203i 


Se er 


ee ee ae 


a 


Later in the chapter we shall consider infinite series in which each term 
contains the power of an independent variable. An example is the series 


2 3 
a. x xt 
i ee 
which, for every real number x, is an infinite series of real numbers. We shall 


see that many functions can be defined by these power series, and this fact is 
of fundamental importance in mathematics and its applications. 
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1. Sequences and Their Limits. Infinite series are defined in terms of limits 
of infinite sequences, and make sense only in these terms. We therefore begin 
by reviewing the ideas of sequences which were introduced in Section 2 of 
Chapter 4. Following that, we shall develop some additional facts about the 
limits of infinite sequences. The definition of infinite series, i.e., of the sum of 
an infinite number of terms, will be given in Section 2. 

An infinite sequence is a function whose domain consists of all integers 
greater than or equal to some integer m. In the normal terminology of func- 
tions the value of a sequence s at an integer 7 in its domain would be denoted 
by s(i). However, it is customary with sequences to denote this value by s;. 
Thus 

So = Str for every integer F > mm. 


The sequence s itself is frequently denoted by {s;} or by an indicated enumera- 
tion of its values: 5, Sm+1,; Sm42,---- In the majority of examples m is 
either O or 1, and the first term of the sequence is then so or s;, respectively. 

An infinite sequence s of real numbers is said to converge to a real number 
L, or, alternatively, the number L is called the limit of the sequence s, written 

mes) ee! 
nH 
if the difference s, — L is arbitrarily small in absolute value for every suff- 
ciently large integer n. The formal definition is therefore: lim s, = L if, 
MD 
for every positive real number e, there exists an integer N such that 
|S, — L| < efor every integer n > N. 

Geometrically, a sequence s of real numbers is an indexed set of points 
on the real line. If the sequence converges to L, then the points s, of the se- 
quence cluster ever more closely about L as 7 increases (see Figure 1). That 
is, S, lies arbitrarily close to L if 7 is sufficiently large. 


S 
S| 52 54 fk ok 


Figure 1 


If the numbers s,, become arbitrarily large as m increases, then the se- 
quence does not converge and no limit exists. In the special case that, for 
every real number B, the values s,, are all greater than B for sufficiently large 
n, we shall write 

lim s,:= oe. 


nae 


The complete definition is: lim s, = » if, for every real number B, there 
nD 


exists an integer N such that s, > B for every integer n > N. A simple 
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example of a sequence which “converges to infinity” in this way is the sequence 
of positive integers 1, 2, 3, 4, 5,.... By reversing the single inequality 
s, > Bin the above definition, we obtain the analogous definition of 

lim 5, = — om: 

na 

It should not be supposed that if an infinite sequence s of real numbers 
fails to converge, then it follows that lim s, = +«. For example, the 
oscillating sequence = 
L414 —1, «>: 


is bounded and does not converge. Another example is the sequence 
O)y G2 z0HS)} O43). 05 

defined, for every integer n > 1, by 

= if n 1s even, . 


Sq = 


0 if n is odd. 


This sequence is not bounded and does not converge, since, as n increases, 
there exist arbitrarily large values s,. However, because of the regular 
recurrence of the value 0, it also does not satisfy lim s, = ~. 


ne 


The basic algebraic properties of limits of real-valued functions of a real 
variable, which are summarized in Theorem (4.1), page 32, also hold for 
infinite sequences of real numbers. We have 


(1.1) THEOREM. /f sequences {s,\ and {t,} converge and if c is a real number, 
then 


fy oon (s+ i.) = lms, 7 Bin ©. 


nD ns Nn 


ry lim (¢s,) = Hi 6). 


N->o0 tf Nees 2° 8) 
Gn) lim (s,f,) = (lim.s,)Cim 4, ). 
n—>0 nz tt > ao 
; lat &. 
(iv) lim + = *=*—->: provided lim t, # 0. 
TD Ln lim In ns 


nD 


We give the proof of (i). Let L; = lim s, and L2 = lim f,, and choose 


TLD i= 
an arbitrary number e. To prove (i) we use the fact that there exist integers 
N, and Ny such that 


| € € 
Is; —- Lil < 5 and jt; — Ls] < = 
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for every integer 7 > N, andj > No. If we set N equal to the larger of N, 
and No, then, for every integer n > N, we have 


€ 
lm iil<5 and |i, — Lal <5. 


Since (Sy + t,) — (Li + Le) = (8, — L1) + (ty — Le) and since |a + b| < 
|a| + |b| for any two numbers a and 5, it follows that 


Sn + tn) (Li + L»)| a Sr ses Ly) ea (tp = L»)| = Sie e255 Ly - a ree L,| 
€ € 
= 4 + ) =f, 


forn > N. This completes the proof of (i). The proofs of the other parts of 
the theorem are similar, and the methods are exactly the same as those used 
in Appendix A to prove (4.1), page 32. 

Similar to (1.1) is the following result, whose proof we omit. 


(1.2) Jf lim s, is the real number L and if lim t, = +x, then 


n> nD 


QQ lim =. 


nO Ly 


aa lin 22 = +2 re > ©, 


ot eee Ee 
Actually we have already used (1.1) and (1.2) in Chapter 4 in evaluating 


definite integrals as the limits of upper and lower sums. The following ex- 
ample is included primarily as a review. 


EXAMPLE |. Determine whether or not each of the following sequences con- 
verges, and evaluate the limit if it does. 


7 2n’ + 5n +2 

(a) {a,\ defined by a, = ee ee 

‘i ae 
(b) on defined by b; = G+ 123 . 

Sas we 

(c) . {c,) defined-by.c, = ae 

| ke + I 
(d) {d,} defined by d, = (—1)" ae 


Note that the definition of each of the above sequences is incomplete because 
we have neglected to specify the domain. However, the omission does not 
matter, since we are concerned only with the question of the limit of each 
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sequence. It follows immediately from the definition of convergence that the 
limit of an infinite sequence is unaffected by dropping or adding a finite 
number of terms at the beginning. 

For (a), after dividing numerator and denominator by n?, we get 


2n'+ 5n+2 _ n me, 


CS See 7 


Using (1.1) and (1.2), we conclude that 


| 
= 


fin 2, = 


Nn no 3 es ae 


So the sequence {a,} converges to 3. 
The ith term of the sequence {b,} can be written 


yg EG. stapictnes 2 | 
aes. ( ') 
— 


l 


b; 


l 
Since lim (: + -) = 1+ 0 = 1, we have 
I> l 


ee (: = 3 


Hence the sequence {b,) also converges to the limit 4. 
For large values of k, the number k + 1 is approximately equal to k, 
and the number k? + 1 is approximately equal to k?. Thus the behavior of 


l 
the ratio — , which ap- 


eae = 
k? + 1 kek 
proaches zero. We conclude that the sequence {c,} converges to zero. A 
more systematic analysis is obtained by writing 


, as k increases, is the same as that of 
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from which it follows by (1.1) and (1.2) that 
k+1 (1 + 0) 


Pons dnd loateamo 
The sequence obtained by taking the absolute value of each term in (d) 
ake k?> + 1 
is one which increases without bound. That is, it is clear that |d,| = i z i 
and that a 
hm = hn 


However, the factor (—1)* implies that the terms of the sequence {d;' 
alternate in sign, and for this sequence we can conclude only that no limit 
exists. 


A sequence s of real numbers is said to be an increasing sequence if 
Sina 2 Sis (1) 


for every integer / in the domain of s. If the inequality (1) is reversed so that 
Si44 < 5;, for every i in the domain of s, then we say that s is a decreasing 
sequence. A sequence is monotonic if it is either increasing or decreasing. 
Note that, just as in the analogous definitions for functions, we use ‘“‘in- 
creasing” and “‘decreasing”’ in the weak sense. That is, an increasing sequence 
is one which is strictly speaking nondecreasing, and a decreasing sequence is 
one which is literally nonincreasing. 

The following two theorems will form the basis of some fundamental 
conclusions about infinite series. Both are statements about increasing 
sequences, and corresponding to each there is an obvious analogous theorem 
about decreasing sequences. 


(1.3) Let s be an infinite sequence of real numbers. If s is increasing and if 
lim s, = L, then s, < L for every n in the domain of s. 


nD 


Proof. Suppose that the conclusion is false. Then there exists an integer N such 
that sy > L. Let a be the positive number sy — L. Since s is an increasing 
sequence, we know that s, > sy for allan > N. It follows that 


S, LA sy LS 


for all n > N. Since the difference s, — L is greater than or equal to the 
positive constant a, it cannot be arbitrarily small. Hence the sequence cannot 
approach L as a limit, contradicting the premise that lim s, = L. This 
completes the proof. = 
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A sequence s of real numbers is said to be bounded above by a real number 
Bifs, < Bfor every n inthe domain of s. If the inequality is reversed to read 
B < s,, we obtain the analogous definition of a sequence s which is bounded 
below by B. The second theorem is: 


(1.4) Let s be an infinite sequence of real numbers. If s is increasing and 
bounded above by B, then s converges and lim s, < B. 
no 

It is easy to see geometrically that (1.4) must be true. Because the se- 
quence is increasing, each point s, on the real line lies at least as far to the 
right as its predecessor s,_, (see Figure 2). In addition, we are given that no 
points lie to the right of B. Hence the points of the sequence must “‘pile up” 
or cluster at some point less than or equal to B. The proof which follows 
serves to make these intuitive ideas precise. 


S 
Sy Sy gd fe 


tpt tt att tt 


B 
Figure 2 


Proof. The range of s, which is the set of all numbers s,, has the number B as an 
upper bound. By the Least Upper Bound Property (see page 7), this set 
has a least upper bound, which we denote by L. Obviously, 


2. (2) 


We contend that lim s, = L. Since L is an upper bound, we have s, < L 
neo 


for every n in the domain of s. Since L is a /east upper bound, there must 
exist values s, of the sequence which are arbitrarily close to L. That is, for 
any « > 0, there exists an integer N such that |L — sy| = L — sy <.. 
Since the sequence is increasing, we have s, > sy for all n > N. Hence 
—S, < —sy and so 


Le — S| = L — Sa < L — by < «, 


for every integer n > N. This proves that lim s, = L, and this fact, together 


with the inequality (2), completes the proof. 


It should be remarked that the essential ideas of Theorems (1.3) and 
(1.4) are not limited to sequences. For example, by making only trivial 
changes in the proofs, we obtain the following analogous results about an 
arbitrary real-valued function f defined on an interval [a, ~): 


(1.3’) If f is increasing and if lim f(x) = L, then f(x) < L for every x in 


[a, x). r—0 
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(1.4’) If f is increasing and if f(x) < B for some number B and for every 
x in [a, «), then lim f(x) exists and, furthermore, lim f(x) < B. 


TH LH 


The latter asserts that every increasing bounded function must approach 
a limit, a result which we assumed without proof in the proof of the Com- 
parison Test for Integrals on page 469. 


PROBLEMS 


1. Determine whether or not each of the following sequences converges, and 
evaluate the limit if it does. 


4 < 1 
ke 1X? 
a= (k+ De (e) a= ( a 1) 
a oe : — os 
n? i 


2. Evaluate each of the following limits. 


(a Gre 3) 


(a) lim — eas (d) lim cos n 
1fi ae 
(b) lim e (e) lim _ cos n 
(c) lim (Wn G/k — se 


ka ze + 1 k— 00 


3. Determine whether or not each of the following sequences {s,} converges, and, 
if it does, evaluate the limit. 


(a) s, = (-1)’, ee See eens 


1 
1+ — for every integer n 
(b) s, = such that 1 < n < 10, 
= for every integer n > 10. 
1 get ae 
1+ 2 if n is a positive even 
I J 
ice a= integer, 
1, if n is a positive odd 


integer. 
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1 
1+ = for every integer 
(d) s = such that 1 < n < 10, 
va for every integer n > 10. 
1 Seg 
4. Let s be the sequence defined by s, = — , for every positive integer n. Draw an 
n 
x-axis, and plot on it the first ten points of the sequence. What is lim s,,? 
5. Let r be a real number, and consider the sequence 1, r, r*, r°, .... Show that the 
sequence converges if and only if —1 <r < 1, and that 
0 1 <r <4, 
lim r° = <1 ie oe ee © 
nee 00 ir > ft. 
What is the behavior of the sequence for r = —1 and forr < —1? 


ee tect = ec"? fore > 0.) 
6. Finish the proof of Theorem (1.1): 
(a) Prove part (ii). 
(b) Prove part (ii). 
(c) Prove part (iv). 
7. Let s and ¢ be two infinite sequences and a a real number such that 
Sn = a+ tn, 
for every integer greater than or equal to some integer k. Prove that 


lim s, = a + lim fp. 


n— n—O 


[Suggestion: It is easy to prove this result directly from the definition of con- 
vergence. Alternatively, one may consider a constant sequence with the single 
value a, and obtain the result as a corollary of Theorem (1.1), part (1).] 


8. Consider the sequence {s,,' defined by s, = n + (—1)", for every integer n > 0. 


(a) Write the first ten terms of the sequence. 


(b) Show that lim s, = «, but that {s,} is not an increasing sequence. 
nw 
(c) Give another example of a sequence {s,} which is not monotonic but for 
which him s, = 90s 
nO 


2. Infinite Series: Definition and Properties. We are now ready to define 
infinite series. Consider an infinite sequence of real numbers @,,, Qn4, 
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An4.,.... From this sequence {a;} we construct another sequence {5s,)}: 
with the same domain, called the sequence of partial sums and defined by 


Sm = Ams 


Smt+1 = Am + Am+1s 
Sm+t+2 = Gm + Am +1 Te Am+25 


That is, for every integer n > m, the number s, 1s given by 
= ee ee CT 


If the sequence {s,} of partial sums converges, we define its limit to be the 
value of the infinite series determined by the original sequence {a;}, and we 
write 


(2) 


EXAMPLE 1. Show that 
— | 
2 = eee ee 
7=0 


For this series the sequence of partial sums is given by 


ae \e 
Pe sp = + 5 
ag | : i re 


and, more generally, by 


l 
hg ge ee Peas 


Note that so = 2 — 1, 5; = 2 — 3, ands, = 2 — 4. Itis not hard to show 


I gine 
that s, = 2 — an for every positive integer 7. Hence 


lim s, = lim (2 — i) a2, 


and it then follows from the above definition that bg: a =a a 
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If, for a given sequence of real numbers d,,, Gn41,..., 1t happens that 
the corresponding sequence of partial sums does not converge, then the 
value of the infinite series is not defined. In this case we shall follow the 
customary terminology and say that the infinite series >°;_,, a; diverges. 
On the other hand, if the sequence of partial sums does converge, we shall 
say that the infinite series >> ;~,, a; converges. Summarizing the above defini- 
tions (1) and (2) in a single formula, we obtain the equation 


(3) 


in which the series on the left converges if and only if the limit on the right 
exists. 

Our first theorem states that if an infinite series >> ;_,, a; converges, 
then the sequence {a;} must converge to zero: : 


(2.1) THEOREM. Jf >_;_,, a; converges, then lim a, = 0. 
nD 
Proof. Let s = {s,\ be the sequence of partial sums. Since the infinite series con- 
verges, there exists a real number L such that 


ie.6) 

> 6 = tims, = 2. 

ae no 
Let s’ be the sequence defined by s}, = s,—1, for every integer n > m + I. 
The range of the function s’ is the same as that of s, and the order is the same. 
That is, enumeration of the terms of both sequences gives the same list of 


numbers: Sm, Sm+41,.... We conclude that 
lm x, = lis, 
Nn nO 


We next observe that, for every integer n > m + 1, 
/ 
yg Bie 8g oo Sa 


Since the limit of the sum or difference of two convergent sequences is the 
sum or difference of their limits [see Theorem (1.1), page 475], we have 


lima, = lims, — lims, =L—L= 0, 
nO ne nO 


and the proof is complete. 


As a result of Theorem (2.1) we see at once that both infinite series 
Da ee 
7=0 

Ee+) 


fest 


5 See ye ee 


cee ee ee eee 
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are divergent. For the first, lim a, = lim (—1)"2, which does not exist, and 


l 
for the second, lim a, = lim (2 + =) == 1D. 
no nwa nH 


[Warning: The converse of Theorem (2.1) is false. That is, it is not true 
that if lim a, = 0, then >°;_,, a; converges. A well-known counterexample 


nD 


is the series discussed in the following example. ] 


EXAMPLE 2. Show that the infinite series 
<1 
dugee bhehitah det ot 
k=1 
diverges. This series is called the harmonic series and is particularly interesting 


i ' pier 
because it diverges in spite of the fact that lim — = 0. To prove divergence, 


nD 


we first observe that s,, the nth partial sum of the series, is given by 


] ] 
Sj a ee ee 


Figure 3 


| 
Next, consider Figure 3, which shows the graph of the function — between 
x 


x = landx = 2-+ |. Withrespect to the partition = 71, Z,...,0 + 1,, 
the upper sum U, is equal to the sum of the areas of the shaded rectangles 
and is given by 


Ue=ltyte ti. 
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Thus U, = s,. Since every upper sum is greater than or equal to the cor- 
responding definite integral, we obtain 


ner 
. = u.> | —dx = In(n + 1). 
1 Xx 


We know that In(n + 1) increases without bound as » increases, hence the 


same is true of s,. Thus 
in <, = «, 


ne 


which completes the proof that the harmonic series diverges. 


The next theorem states that infinite series have what is commonly 
called the property of linearity. The result is a useful one because it shows 
that convergent infinite series may be added in the natural way and also 
multiplied by real numbers. Note that we have come across the property of 
linearity before. It is one of the basic features of finite series and also of def- 
inite integrals. 


(2.2) If Som a; and 7m by are convergent infinite series and if c is a real 
number, then the series >-;--m (a; + b;) and oj m ca; are also convergent, and 


O Y@to= Yat Ds 


=m =m 
[e.¢) [e.¢) 
(11) > Ca; = c) a;. 


Proof. The proofs of (i) and (ii) are direct corollaries of the corresponding parts of 
Theorem (1.1), page 475. Let {s,} and {7,} be the two convergent sequences 
of partial sums corresponding to > ag 00M ee _m Di, respectively. That is, 


n n 
in = ) : ais in = be, 
=m =m 


io. @) 


Pla; = tim, 3 b; = lim th. (4) 


zs n— oe n— 
=m 


By part (i) of Theorem (1.1), we have 
lim (s, + ¢,) = lim s, + lumedg, (5) 


nO m—> 00 no 


which shows, first of all, that {s, + f,\ is a convergent sequence. The lin- 
earity property of finite sums implies that 


n 


Snot = ys aad 3 bi = . (ai i), 


1m 
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from which we conclude that {s, + ¢,} is the sequence of partial sums 
corresponding to the series >; (a; + b;). Hence 


>, Gai + b:) = lim (sn + tr). (6) 


4 n> 
=m 


Substituting from equations (6) and (4) into equation (5), we obtain 


3 (a; + b;) = _ ar $ bi, 


and this completes the proof of part (1). Part (ii) is proved in the same way, 
and we omit the details. 


As an application of Theorem (2.2) we may conclude that if a series 
Dizm ai diverges and if c # 0, then >°;~,, ca; also diverges. For if the latter 
series converges, we know from part (11) of (2.1) that 


oO 


ie.¢) 
Yeas = Yo Peas = SO a 
=m 


=m 


oO |— 


and that the series on the right converges, contrary to assumption. For 


eae 
example, since the harmonic series > — diverges, it follows at once that the 
series i=1 


5" stautpt 


also diverges. 

It is an important corollary of the next theorem that the convergence or 
divergence of an infinite series is unaffected by the addition or deletion of 
any finite number of terms at the beginning. 


(2.3) If m <1, then the series > ja; converges if and only if ¥;~ 4; 
converges. If either converges, then 


Yam Dat Da 


=m =m 


Proof. Let {s,\ and {t,’ be the sequences of partial sums for >>;—,, a; and >> j_) a, 
respectively. Then 


n 
“ > Ai, for every integer n > m, 


= > Ai, for every integer n > /. 
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If n is any integer greater than or equal to /, then obviously 


n l—1 n 

a oat re, 

=m =m i=l 
Hence 


7—1 
Sn = 22 G4" Tas for every integer n > /. 
=m 


The number >-/_,,, a; does not depend on n, and is fixed throughout the proof. 
Thus, for n > 1, the sequences {s,,} and {f,} differ by a constant. It follows 
that one converges if and only if the other does and that 


nD nD 


rs 
ints, = > a; + lim fh, 
=m 


(see Problem 7, page 481). This completes the proof, since by definition, 


io) oe) 
lim Sy, = ai and lim tn = dj. 
im i=l 


nD n—- 


As an illustration, consider an infinite series >| ;_, a; whose first thousand 

| 
terms we know nothing about, but which has the property that a, = an for 
every integer » > 1000. We have shown in Example | that the series 


io @) 


a a converges, and it follows by Theorem (2.3) that Zz 5 also converges. 


1=0 7=—1001 


Since the latter series is precisely the series >> ;~199, @;, a Second application 
of (2.3) establishes the convergence of the original series }7;~, a. 
An infinite geometric series is one of the form 


in which a and r are arbitrary real numbers. For example, by taking a = | 


e e = l ° e 
and r = 4, we obtain the convergent series > th In studying the question 
i=0 


of the convergence or divergence of geometric series, it is sufficient to take 
a = | and consider the simpler series 


a es ee ay ae ee a4 


0 


io. ¢) 


7 
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For if this series converges, then so does >> ;_, ar’, and 
Se . co . 
bg ar’ = ay? me 
i=0 i=0 


On the other hand, if (7) diverges and a # 0, then >°;., ar’ also diverges. 
The principal result about the convergence of geometric series is the follow- 


ing: 


(2.4) The geometric series (7) converges if and only if -l<r<l. Jf it 
converges, then 


Proof. If r = 1, the series (7) is the divergent series 1 + 1+ 1+ ---. Hence, 
in what follows, we shall assume that r ~ 1. The sequence {s,} of partial 
sums is defined by 


Sn = p Se 1 oe ae ee eae 
1=0 
for every integer n > 0. Observe that 
ltrs, =1+rdatrt+-:::4+r”") 
= | jp pe peed = 


On the other hand, we have the equation 


Sn fe pi = eis 


It follows that 1 + rs, = sn, + r™*t!, whence 1 — r”t! = 5,(1 — r), and so 
— Be 
sn = wate 
1-—r 


The proof is completed by considering two cases. First of all, suppose that 
—l<r<l. “Then tim r= int r= v0 Ge Problem 5, page 481), 


nH wn nD 
and therefore 
Sat ahag sotcalben ahi 
tas ND . 1 =f 1 ee! (3 


Second, suppose that r < —1l orr> 1. For neither of these possibilities 
does lim r”*! exist (again, see Problem 5, page 481). It follows that lim s,, 


nD no 


also does not exist, and hence the series >°;_, r’ diverges. This completes 
the proof. 
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PROBLEMS 


1. Determine whether or not each of the following infinite series converges, and 
evaluate it if it does. 


oO 7 = * is ok 6i 
(a) Da x oP ae ee 
=~. a . 1 
) Dos © YG+) 
k=] k=0 
=.f/1 1 i — 1 
a+!) ® Lagi 
oa) 1 1 : ore k 2a i ' 
a a ria at cere 
j=0 k=0 
2. Consider the infinite series }>;~» a; defined by 
asi = 5 C= «1,2, : 
agi) = 0, aS 0, t. 23 BES E- 


Write out the sum of the first ten terms. Does the series converge? If so, to what 
value”? 

3. Using Theorem (2.2), show that if oo a; converges and if ee b; diverges, 
then >-;_,,. (a; + 6;) must diverge. 

4. Is it true that if the series >-;_,, (a; + 6, converges, then both >°;_,, a; and 
= 5 b; must also converge? Give a reason for your answer. 


5. Prove that the harmonic series 2, : diverges using the following elementary 
ji 
argument. Begin by grouping the terms of the series: 


a | 
Bop tse t+ Gs etet yD 
k=1 


= ae Meme y ee ok seein <b oc Gage © Seog 7 ee 


and observe that 


ee ae toe? 
ee ee ee oe 
ae _wf{l 1 if 
6. Consider the infinite series “2 ree area . By writing out a few terms of the 
k= 


sequence of partial sums, show that the series converges, and give its value. 

7. An infinite series of the form >¢;_,, (a; — a;11) 1s called a telescoping series 
(see Problem 6). Prove that it converges if and only if the sequence {a,' con- 
verges. If it does converge, what is its value? 
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8. Determine whether or not each of the following infinite series converges, and 
evaluate it if it does. 


= 2i+ 1 
©) Lae+aeD 


1=2 


- k+1 2 3 4 
(c) in( ) — in(7) + in(3) + in($) ae ce 
d. k 1 2 3 


(Hint: Look at Problems 6 and 7.) 


3. Nonnegative Series. The theory of convergence of infinite series is in 
many respects simpler for those series which do not contain both positive 
and negative terms. A series which contains no negative terms is called 
nonnegative. Thus >_;_,, a; is nonnegative if and only if a; => O for every 
integer i > m. In this section we shall study two convergence criteria for such 
series: The Integral Test and the Comparison Test. 

Let > 2, a; be an arbitrary infinite series (not necessarily nonnegative), 
and let {s,} be the corresponding sequence of partial sums. We recall that 
Sn = dtm 4;, for every integer n > m, and that, if {s,} converges, then 
Yim a; = lim s,. We shall extend the convention regarding the symbol x 


nD 
and write = 
2a ie aes (or —x) 
i= 
if and only if lim s, = « (or —<x). 


nn 
It may very well happen that a series neither converges nor satisfies 
Yim 4; = +x. For example, the divergent series 


yep =) ee 


7=0 


has for its sequence of partial sums the oscillating sequence 1, 0, 1,0, 1,.... 
However, for nonnegative series, there are only two alternatives: 


(3.1) Every nonnegative series > i;~, a; either converges or satisfies 
pagres te 


The proof of this fact follows directly from the following two lemmas: 


(3.2) If 2, a; is a nonnegative series, then the corresponding sequence {Sn} 
of partial sums is an increasing sequence. 
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Proof. For every integer n > m, we have Sn41 = Sn + Gn41. Since the series is 
nonnegative, it follows that 5,41 — Sn = Gn4i1 = 0. Hence 


ee Pe for every integer n > ™, 


which is the definition of an increasing sequence. 


(3.3) If {s,} is an increasing infinite sequence of real numbers, then either it is 
bounded above and therefore converges or else lim s, = ~. 


nD 


Proof. If the sequence is bounded above, then it is proved in Theorem (1.4), page 479, 
that it must converge. Suppose it is not so bounded. Then, for every real 
number B, there exists an integer N such that sy > B. Since the sequence is 
increasing, it follows that s, > sy foreveryn > N. Hences, > B, for every 
integer n > N, and this is precisely the definition of the expression 
lim s, = ©. 


ne 


ot 


We come now to the first of the tests for convergence of nonnegative 
infinite series. It is a generalization of the method used in Section 2 to prove 
the divergence of the harmonic series. 


(3.4) INTEGRAL TEST. Let f be a function which is nonnegative and decreasing 
on the interval [m, «). Then the infinite series )\;~ a; defined by 


a; = f(i), — for every integer i > m, 


is convergent if and only if the improper integral i; f(x) dx is convergent. 


Proof. The series Be _m 4 IS nonnegative, and its corresponding sequence {s,; of 
partial sums is therefore increasing. Figure 4 illustrates the graph of the 


m mt+i1m+2 n—1 n X-axis 


Figure 4 
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function f over an interval [m, n], where n is an arbitrary integer greater than 
m. Since f is decreasing, its maximum value on each subinterval of the parti- 
tion o = {m,m-+1,...,n} occurs at the left endpoint. Moreover, each 
subinterval has length 1. Hence the upper sum U,, which is equal to the sum 
of the areas of the rectangles lying above the graph in the figure, is given by 


—1 


- {WM = ae = Sy—1- 


Similarly, the lower sum L, is equal to 


ie f@ = Ss aj 


i=m+1 t=m+1 


As always, we have 


t.< | I(x) dx < Us, 


and it follows that 


Sn — Am = | Six) dx = Sn—1- (bp 


The crux of the proof of the Integral Test is in the inequalities (1). In com- 
pleting the argument, we consider the “‘if’’ and ‘“‘only if’? parts of the theorem 
separately. 

ee Let fz f(x) dx be a convergent improper integral. That is, 
lim = f(x) dx exists. Since f is nonnegative on [m, ©), the integral 


bow 


ie f(x) dx is an increasing function of b. Hence 


‘ | Far = | f(x) dx, for every integer n > m, 


[see (1.3’), page 479]. From (1) it follows that 


Sn ta? | {Q)ies @, = | Jaa, 


for every integer n > m. Hence the increasing sequence {s,\ is bounded 
above and therefore converges. The convergence of the sequence of partial 
sums is equivalent to the convergence of the corresponding infinite series, so 
we conclude that >-;_,, a; converges. 

“Only if.’ Suppose that >-;_,, a; converges. Then 


O 
Sp —1 = Sp < ; } ai, 
LM 


for every integer n > m [see (1.3), page 478]. For any real number 4 in 
[m, «), choose an integer n > b. Since { (x) dx is an increasing function 
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of b, we obtain from (1) 


b n 
| f(x) dx < | f(x) dx < Sn-1. 


Hence 


b 
| Tings =< De a;. 


The integral is therefore bounded above, and it follows that lim J : FCeydx 


b—oo 
exists [see (1.4’), page 480]. Hence es f(x) dx is a convergent improper 
integral, and the proof of the Integral Test is complete. 


The convergence or divergence of many infinite series can be determined 
easily by the Integral Test. Important among these are series of the form 


where p is a positive real number. Such a series is called a p-series. An 
example is the divergent harmonic series, for which p = 1. The basic con- 
vergence theorem 1s 


—. | 
(3.5) The p-series 3 ~p converges if and only if p > 1. 
i 


==] 


Proof. The function f defined by f(x) = = is nonnegative on the interval [1, «), 
x 


and is also decreasing on that interval since we have made the assumption 


that p > 0. Moreover, it is obvious that f(/) = = for every positive integer /. 


If p ~ 1, then i 


| ba =a +c. 
oe ce 


Hence we have the three computations: 


o<p<i:| De eg a ke , 
oer ed 1 


77 WD bso 


Pea ake Lax= | Lav timind = «, 
i [oe 


é b> 


rag ee 1 1 
oy pean a9 a 
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ee See . 
It follows that | = dx is convergent if and only if p > 1, and the Integral 
1x 


Test therefore completes the proof. 


Thus the first of the following three p-series diverges, and the last two 
converge: 


ag 
- 
I 
a 
hi 
oe 
i 
a 


l l 
: a SS oe ee, 
kare Wi? 3573 


Mes 
are 
| 


= 1 I ft 
2 a 


converges or diverges. 


. l 
EXAMPLE |. Determine whether the series i, eel 


k=1 


| 
The function f defined by f(x) = 1 is nonnegative and decreasing on 
xX 
| 
the interval [1, «), and obviously f(A) = ieee Since 


l 7 
poe dx = oo /2x +s, 


we have 


| ney fe = lim (arctan /2 b — arctan V2) 


= | — = arctan's/2 | 
V2 
Hence the integral is convergent, and therefore so is the series. 


We come now to the second of our convergence tests. 


(3.6) COMPARISON TEST. Jf >-;~,, a; is a nonnegative series and if ));~» 6; 
is a convergent series with a; < b; for every i > m, then Yoj~, a; converges 
and. > 4, @; =, Oi. 
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Proof. Let {s,' and {t,} be the sequences of partial sums for )>;_» a; and > \j=m di, 
respectively. The hypotheses 0 < a; < 5; imply that 


: ge Se re for every integer n > m, (25 


and also that both series are nonnegative. Hence both {s,} and {t,} are 
increasing sequences. The convergence of > i-m 6; Means that {r,) converges 
and that lim tf, = > \;2m b;. It follows that 


n—-2 


eS a b;, for every integer n > m 


=m 


[see (1.3), page 478]. Hence, by (2), 


(ee) 
ee x b;, for every integer 1 > m. 


Thus {s,\ is increasing and bounded above by 5 be sequence there- 


fore converges and 
ims, si pe; 


nO 


[see (1.4), page 479]. The convergence of s, implies the convergence of 
> im a; and that the value of the series is lim s,,. Hence 


ND 
ore 0 
>" a; = >. ee 
=m =a 


and the proof is complete. 


| 
27 — 7 
converges. We first observe that the first two terms of the series are negative. 


EXAMPLE 2. Use the Comparison Test to prove that the series Oy aa 


is non- 


However, 2i2 — 7 > O provided i > 2, and so the series Ys sul 


negative. It is sufficient to prove the latter series convergent ae of the 
important fact that the convergence of an infinite series is unaffected by any 
finite number of terms at the beginning. As our test series we take the con- 


. l e ° 
vergent p-series > —. To use the Comparison Test, we wish to show that 
1=1 = 


— ce <5 for every integer i > 2. (3) 


This inequality is equivalent to 2? < 27? — 7, which in turn is equivalent to 
i? > 7. The last is clearly true provided i > 3. Thus we have proved 
| ] 


eee 


os ; sy 
so for every integer i > 3, (4) 
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which is slightly weaker than (3). However, (4) is certainly sufficient. We 


wl 
know that the series > - converges. It follows from (4) by the Comparison 
ig & 


l 
Test that Sy 
Creal asd 


converges and, as a result, the original series 


co 


l 
23 25 does also. 


Example 2 illustrates a useful extension of the Comparison Test: The 
series >; a; converges if there exists a convergent series ~;~~. 5; such that 
0 < a; < b; eventually. The assertion that 0 < a; < 6; eventually means 
simply that there exists an integer V such that 0 < a; < 5b; for every integer 
i> N. The justification for this extension is Theorem (2.3), page 486. A 
similar observation should be made about the Integral Test. It may be neces- 
sary to drop a finite number of terms from the beginning of the series under 
consideration before a convenient function f can be found which satisfies the 
conditions of the test. 

The Comparison Test is as useful for proving divergence as convergence. 
It is an immediate corollary that if the nonnegative series > ;—,, a; diverges 
and if a; < 6; for every i > m, then >~;~,,b; also diverges. For if >) j~,, 6; 
converges, the Comparison Test implies that >—;_,, a; converges, which is 
contrary to assumption. 


l 
‘(k++ 5) 
diverges. If we use the Comparison Test, we ae decide whether to look 
for a convergent test series with larger terms to prove convergence, or a 
divergent test series with smaller terms to prove divergence. To decide which, 
observe that for large values of k, the number k? + 5 is not very different 


EXAMPLE 3. Determine whether the series 3 converges or 


| l 
from k?, and therefore (ee + 5) is approximately equal to jai * Stated 
more formally, we have 
iy Oe ee) ee 
ko ces = k— oo k? 
= - (1 -+- é) a= ] 


l 
This comparison, together -“ the pig of the p-series Ee jas? leads 
us to believe that the series i rai diverges. Hence a shall try a 
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re | 
divergent test series. The most obvious candidate, = p28 fails to be useful, 
since the necessary inequality, k=l 


l 
k23 > fo sya. 


is clearly false for every value of k. However, the series 2. 5218 is also 
— 
divergent, and we may ask whether or not it is true that 


a ere (5) 


This inequality is equivalent to 8k? > k? + 5, and hence to 7k? > 5, which 
is certainly true for every positive integer k. Hence (5) holds for every integer 
k > 1, and it therefore follows by the Comparison Test that the series 


of l : 
2 (4 5) diverges. 


PROBLEMS 
1. Test the following infinite series for convergence or divergence. 


(a) (f) sepia wea 


(b) ee (2) Se 


n=1 Vn - 2 

x 1 ie.) 1 
(c) = (h) GT 

ja a ke i=0 oe 

- 1 =~. 1 
= Gane Seer 

cae kp ia 

GO 1 [e.¢) 1 
©) Qian Oo Raa 


2. Using the Integral Test for infinite series and the Comparison Test for integrals 
(page 469), determine whether each of the following series converges or diverges. 


ive 


(a) 3 = (b> sin 5 


i—" eal 


3. Using the Integral Test, prove the theorem that, for positive r, the geometric 
series >-;_y r’ converges if and only if r < 1. 
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4. Let vj a; and >-;_,, b; be two convergent nonnegative series. Using the 
Comparison Test, prove that the series >>;_,, a:b; also converges. 
5. Prove the following theorem, which is hinted at in Example 3. If >-;_,, b; is a 


te me ; Se 
positive series (i.e., b; > 0 for every integer i > m) and if lim — = 1, then the 


n> n 


P oa) =f 5p oO 
series >. ;—~m a; converges if and only if >) ;—m b; does. 


4. Alternating Series. Special among infinite series which contain both 
positive and negative terms are those whose terms alternate in sign. More 
precisely, we define the series >0;_,, a; to be alternating if a;a;,, < O for 


every integer i > m. It follows from this definition that an alternating series 
is one which can be written in one of the two forms 


ee or pe hase 


=m =m 


where b; > O for every integer 7 > m. An example is the alternating har- 
monic series 


Oe 
| 


i°6) 2 l 
Fee 
I=1 
An alternating series converges under surprisingly weak conditions. 
The next theorem gives two simple hypotheses whose conjunction is sufficient 
to imply convergence. 


(4.1) THEOREM. The alternating series > ;—,, a; converges if: 


(i) |dn4i1| < |a,|, for every integer n > m, and 
(ii) lim a, = 0 (or, equivalently, lim |a,| = 0). 


nD nan 


Proof. We shall assume for convenience and with no loss of generality that m = 0 
and that a; = (—1)‘b; with 5; > O for every integer i > 0. The series is 
therefore eeee (—1)‘b;, and the hypotheses (i) and (11) become 


G) tesa for every integer n > 0, and 
t5 — hab, = 0. 
nw 


The proof is completed by showing the convergence of the sequence {s,,| of 
partial sums, which is defined recursively by the equations 


fo ca (—1)’bo = bo, 
Sy = Sacer H t= Hes 47 3 
The best proof that lim s, exists is obtained by an illustration. In Figure 5 


nw 


we first plot the point sy) = bo, and then the point s; = sy) — b;. Next we 
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b 
b, 
b; 
hema 
SS S 3 L S4 S2  So= by 


Figure 5 


plot so = s; + by and observe that, since b, < b;, we have sy < so. After 
that comes s3 = so — b3 and, since b3 < he, it follows that s; < s3. Con- 
tinuing in this way, we see that the odd-numbered points of the sequence 
fs») form an increasing subsequence: 

Ses Se a eS ES Cl) 


and the even-numbered points form a decreasing subsequence: 
oS St ae 


Furthermore, every odd-numbered partial sum is less than or equal to every 
even-numbered one. Thus the increasing sequence (1) is bounded above by 
any one of the numbers s2,, and it therefore converges [see (1.4), page 479]. 
That is, there exists a real number L such that 

lim Son—-1 = = 


ne 


For every integer n > 1, we have 
son = SIn—1 a3 bon, 


and, since it follows from (ii’) that lim 52, = 0, we conclude that 


nO 


N—> NO NO 
=L—-O=L. 


We have shown that both the odd-numbered subsequences {s2,—1; and the 
even-numbered subsequence {s2,} converge to the same limit L. This implies 
that lim s,, = L. For, given an arbitrary real number « > 0, we have proved 


tH 


that there exist integers N; and Np» such that 
lSon—1 — L| <, | whenever 2n — 1 > M,, 
Son — L| <e, | whenever 2n > No. 


Hence, if is any integer (odd or even) which is greater than both N; and No», 
then |s, — L| < e. Thus 


L = Jims, =->¢ (=1)'0:, 


ND pay: 


and the proof is complete. 


500 INFINITE SERIES [cuap. 9 


As an application of Theorem (4.1) consider the alternating harmonic 
series 


Hy s=1-4+4-44+-: 
t= 


The hypotheses of the theorem are obviously satisfied: 


(1) ce _ = = - for every integer n > 1, and 
ee e 1 e l 

sae ae 
(11) Bg 1) Le - 0. 


Hence it follows that the alternating harmonic series is convergent. It is 
interesting to compare this series with the ordinary harmonic series 


iy : =1+%3$+%4+4+-°:°::, which we have shown to be divergent. 
We see that the alternating harmonic series is a convergent infinite series 
Dizm 4: for which the corresponding series of absolute values >>22,, |a,| 
diverges. 

For practical purposes, the value of a convergent infinite series >" ;_,, a; 
is usually approximated by a partial sum )07_,,a;. The error in the ap- 
proximation, denoted by E,,, is the absolute value of the difference between 
the true value of the series and the approximating partial sum; L.e., 


io.) 


Sa; sab Sey 


=m =m 


Zz, a 


In general, it is a difficult problem to know how large n must be chosen to 
ensure that the error E, be less than a given size. However, for those alter- 
nating series which satisfy the hypotheses of Theorem (4.1), the problem is an 
easy one. 


(4.2) If the alternating series > \;~,, a; satisfies hypotheses (i) and (ii) of 
Theorem (4.1), then the error E,, is less than or equal to the absolute value of the 
first omitted term. That is, 

Ey < |Qn4i|, for every integer n > m. 
Proof. We shall use the same notation as in the proof of (4.1). Thus we assume that 


m = 0 and that a; = (—1)‘b;, where 5; > O for every integer i > 0. The 
value of the series is the number L, and the error E,, is therefore given by 


oO n 
; a ‘Sikes ai 
7=0 7=0 


p= = | — s5,|. 
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Since |dn41| = 5x41, the proof is completed by showing that 
JL — Sn| < bn41, for every integer n > 0. 


Geometrically, |L — s,| is the distance between the points L and s,, and it can 
be seen immediately from Figure 5 that the preceding inequality is true. To 
arrive at the conclusion formally, we recall that {s2,_1} is an increasing se- 
quence converging to L, and that {so,} is a decreasing sequence converging 
to L. Thus if n is odd, then n + 1 is even and 


Sn SL S Sn41. 
On the other hand, if 7 is even, then n + 1 is odd and 
i412 SS Sa. 


In either case, we have |L — sn| < |sni1 — S|. Hence, for every integer 
mn 2 0, 


E, = |L — Sa| < [Seti — Sal = l@nqal, - 
and the proof is complete. 
In Table 1 we have computed some partial sums which approximate the 
value of the alternating harmonic series. Each entry in the second column is 


an approximation, and the corresponding entry in the third column is the 
upper bound on the error provided by Theorem (4.2). 


TABLE 1 


Alternating harmonic series: >» (— i ; : 


ti=1 


Partial sums: s, = 1 — $+4—**s + (—1y*=, 
Ne ] upper bound 
n Sn = approximation Gyan = cr te es 
I l 4 
§ i 
4 13 5 
10 0.646 0.091 
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a 


PROBLEMS 


. Determine whether each of the following alternating series converges or diverges. 


Give the reasons for your answers. 


rye ee eae 
n=2 


i=1 € 
oo = ; 1 
a B a =| du Se 
(c) pa oy (g) > cos(irr) 
ke +1 fons 
cos(k7r) 
we Pores at ie 
. Prove that, for any infinite sequence {a,\ of real numbers, lim a, = 0 if and 


only if lim |a,| = 0. (Hint: The proof is simple and straightforward. Go directly 


n—- OO 


to the definition of convergence of an infinite sequence.) 


. For each of the series >-;_,, a; in Problem 1, determine whether or not the 


% . ie.¢) 
corresponding series of absolute values > >;_», |a;| converges. 


. ° . (oe) as 
. Give an example of an alternating series }°;_,, a; which you can show con- 


verges, but which fails to satisfy condition (i) of the Convergence Test (4.1). 


. The first of the following examples comes from the formula for a geometric 


series, and the last two follow from the theory developed later in this chapter: 


oe : 1 
(b) n2= Do- 1 5G = t- 


(c) w= 4arctan1 = LD apy = 4 = $e o 
i=0 


If the value of each of these series is approximated by a partial sum : ee 
how large must n be taken to ensure an error no greater than 0.1, 0.01, 0.001, 
5 a 


Absolute and Conditional Convergence. An infinite series >, a; is 


said to be absolutely convergent if the corresponding series of absolute values 
Dvi=m |@;| is convergent. If a series #,, a; converges, but >%,, |a,| does 
not, then we say that >> ;_,, a; is conditionally convergent. An example of a 
conditionally convergent series is the alternating harmonic series: We have 
shown that 


eS 
SS 


i Ms 
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converges, but that 


i°.6) oO ] 


ee ee 


diverges. 

There are many examples of series for which both >°;~,, a; and >°;_,, |a;| 
converge, and also many where both diverge. (In particular, for nonnegative 
series, the two are the same.) There is the remaining possibility that >> ;_,,, |a,| 
might converge, and >" ;_,, a; diverge. However, the following theorem shows 
that this cannot happen. 


(5.1) THEOREM. [If the infinite series > ;—,, a; is absolutely convergent, then it 
is convergent. 


Proof. Since |a;| > —a;, we have a; + |a;| > 0, for every integer i > m. Hence the 
. (°6) ° . ° 
series )-;~m (ai + |a;|) is nonnegative. Since a; < |a;|, we also have 


a; + lal < ail + lai] = 2Iail, (1) 


for every integer i > m. The assumption that Soi -m 4 iS absolutely conver- 
gent means that the series >-;_,, |a;| converges, and, hence, so does the series 
> i-m 2\a;|. It therefore follows from (1) by the Comparison Test that the 
nonnegative series >-;-m (a: + |a;|) is convergent. We conclude from 


Theorem (2.2), page 485, that 


>, % = 3 (ai + lal) — > Iai 


=m =m 


and that >-;_.. a; converges. This completes the proof. 


Thus the only possibilities for a given series are those illustrated by the 
following scheme: 


absolutely convergent 


conditionally convergent 


see, Ss nag 


divergent 


EXAMPLE |. Classify each of the following infinite series as absolutely con- 
vergent, conditionally convergent, or divergent. 


: at) Se eee b : ee ow | 
(a) 2 ( ae) (b) 2 ¢ ) Fe 15 
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If we let a, = (—1)* , the alternating series in (a) will converge 


] 
VE 
if: 

(i) |@z4i| < |a,|, for every integer k > 1, and 
(ui) Jim ja,| =-0. 
k 


> 


[See Theorem (4.1), page 498.] We have 


1 
Wk =o 


l 
Vk +1 
Hence condition (i) becomes 
Vk+2°> Vk+1 


which is certainly true. Condition (ii) is also satisfied, since 


lax| = and aka] = 


for every integer k > 1, 


and it follows that the series }°;,—, a, converges. However, it is easy to show 
that >°;~, |a,| diverges by either the Comparison Test or the Integral Test. 


Using the latter, we consider the function f defined by f(x) = | ; 
V 
which is nonnegative and decreasing on the interval [1, 0). We have 
| 

Sk) = Bey A eae | = |az,| and 

. aoe ) aaa 

feddx = | — dx = lim Vx +1 |] 
1 es eae bx 
= lim 2/b + 1 — 2/2] = ao. 
b> 0 


The divergence of the integral implies the divergence of the corresponding 
series >°;—1 |a;|, and we conclude that the series (a) is conditionally con- 
vergent. 

For the series in (b), we might apply the same technique: Test first for 
convergence and then for absolute convergence. However, if we suspect 
that the series is absolutely convergent, we may save a step by first testing 
for absolute convergence. In this particular case, the corresponding series of 


_< l 
absolute values is 2 yok? — 15) 


G - The latter can be shown to be convergent 
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by the Comparison Test. For a test series we choose the convergent series 


—. 2 
6% ia The condition of the test is that the inequality 
k=1 
l 2 

a ee aie 

\2k2 — 15| — k? 
must be true eventually. We shall consider only integers k > 3, since, for 
these values, 2k? > 18 and hence |2k2 — 15| = 2k? — 15. For those 


integers for which k > 3, the inequality 


l 2 
We — 15 =k 
is equivalent to k? < 4k? — 30, which in turn is equivalent to k? > 10. The 
last is true for every integer k > 4. Hence 
l 2 


< 


ag ae —— 3 : A 
esl for every integer k > 4 


- l 
It foll that eee 
ollows tha 2 be — 15 


is absolutely convergent. 


converges, and therefore that the series (b) 


6 fini Pasty oe 
(5.2) RATIO TEST. Let ));=,, a; be an infinite series for which lim ere 


(or on). rs) In| 
(i) Ifq <1, then the series is absolutely convergent. 
(ii) Ifg > 1 (including q = x), then the series is divergent. 


(iii) If gq = 1, then the series may either converge or diverge; 1.¢., the 
test fails. 


Jan +1 = 


Proof. Suppose, first of all, that lim gq <1. This implies that the ratio 


nn a,,| 


Jantal is arbitrarily close to g if n is sufficiently large. Hence if we pick an 


la. 
arbitrary number r such that g <r < 1, then there exists an integer VN > m 
such that 
nak 
| i Sul for every integer n > N. 2) 
An 


We shall show by mathematical induction that (2) implies that 
lanai] < r'law|, _ for every integer i > 0. (3) 


If i = 0, then the inequality in (3) becomes |ay40| < r°|ax|, which is true. In 
the second part of an inductive proof we need to show that, if the inequality 
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(3) is true for 7 = k, then it is also true fori = k + 1. The assumption, then, 
is that 


lave] < r’layl, (4) 
and we want to prove that 
laveesal < "lanl. 
If we multiply both sides of inequality (4) by the positive number r, we get 
rlay4i| < lay. Gy 


But, inequality (2) tells us that 


rseaes 
Ear oe 4 
and hence that 
langeaa| < rlay +e. (6) 


Combining inequalities (5) and (6) we have 
lan 44-41] tee gage 


completing the inductive proof. Since |r| <1, the geometric series 
> io |av|r’ converges, and it follows from (3) by the Comparison Test that 
the series }°;~) |ay .,;| converges. However, 


00 00 
2 lay +.;| = = lasl, 
4=0 tN: 


and the convergence of >-;_y |a;| implies the convergence of >>;_,, |a;|. 
Hence the series >-;_,,, a; converges absolutely, and the proof of part (i) of 
the theorem is complete. 


An 1 
la 41 = gq >-1, and let _r be an arbitrary 


We next assume that lim 
nn la,,| 


number such that 1 <r <q. Then there exists an integer N > m such that 


\an+1| 


ian =, for every integer n > N. 
Ly 


In the same way in which we proved that (2) implies (3), it follows by induc- 
tion from the preceding inequality that 


lav+i| > r'lay|, for every integer i > 0. 
Since r > 1, we know that lim r‘ = © (see Problem 5, page 481), and there- 
fore also that 
lim la,| = lim jay.,) = ©. 
NO 1— 0 


However, if the series >\;_,, a; converges, then it necessarily follows that 
lim |a,| = lim a, = 0. [See Theorem (2.1), page 483, and Problem 2, 


no 


na 
page 502.] Hence >°;_,, a; diverges, and part (ii) is proved. 
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Part (iii) is proved by giving an example of an absolutely convergent 
series and one of a divergent series such that g = 1 for each of them. Con- 


(o) 
; 1 
sider the convergent p-series y —, which, being nonnegative, is also 
2 


i) 


1 
absolutely convergent. Setting a, = — , we obtain 
it 
SS eee 
= (n+1)2 nm+2n+1 
and 
Tantil _ anti _ [Se SRE TOES 
ro fee 2n os 2 1 
an ee 
nN fiz 
Hence 
n 1 
li oats a im 5 = 1 - 
nD n ae Sa + = - — 
n Mn 


; Sel 
For the second example, we take the divergent harmonic series :3 —. If we 


1 1 i=! 
let a, = -—, then a,41 = and 
n n+ 1 
2 ae, ne ee 
lan| a. n+ 1 cs 1 
nN 
For this series we also get 
° | iL) | ° 1 
= = 7 = A. 
N— OO An n— 
2 7 ase 


iH 


The Ratio Test is therefore inconclusive ifg = 1, and this completes the proof. 


If n is an arbitrary positive integer, the product n(m — 1)--:3-2- 11s 
called n factorial and is denoted by nm! Thus 3! = 3:2:1= 6 and 
5! = 5-4-3-2:-1 = 120. Although it may seem strange, 0! is also defined 
and has the value 1. A convenient recursive definition of the factorial is 
given by the formulas 


is 
(n + 1)! 


n+ 1)nl, for every integer 7 > 0. 
y g 


EXAMPLE 2. Prove that the following series converges: 


n! 


=— | l | 
ee ae 
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o I 
We write the series as >> ,;=» @, by defining a, = a for every integer n > 0. 
Then n 


l 
Gna) Ol tae ow 
|an| l (n+ 1)! 
n! 
n! 


Hence 


q = lim Manta) _ lim ! == 0 


ame — eee noon + | 


Since g < 1, it follows from the Ratio Test that the series is absolutely 
convergent. But absolute convergence implies convergence [Theorem (5.1)], 


ie.¢) 


and we conclude that the series ‘2 — converges. 
n} 
0 


EXAMPLE 3. Show that the infinite series 


Soir = 14 Or $3 + 4 


2. 
I 
= 


converges absolutely if |r| < 1 and diverges if |r| > 1. This series is related 


to the geometric series D2) 7’ = 1 +r+72+--:, and ina later section 
we shall make use of the relationship. To settle the immediate question of 
convergence, however, we set a; = ir’ ' for every positive integer 7, and write 


the series as die , a;. Observe, fab of all, that if i7| >t tien le = ir” 
and 

lim |a,| = lim tg ai 

nD 
Hence, if |r| > 1, the series must diverge, since convergence would imply 
lim |a,| = 0. This proves the second part of what is asked, and we now 


nD 


assume that |r| < 1. If = 0, the series is absolutely convergent with value I, 
so we further assume that r ~ 0. Then 


larga] _ (+ Dlr -"t1i = (14 ‘Y ih 


ld| ag aa 


and so 


lim Hott = jim ( 2 ‘) in| = |r. 


ND | An N—>D 
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Thus gq = |r| < 1, and the Ratio Test therefore implies that the series is 
absolutely convergent. 


The next theorem, with which we conclude the section, establishes a 


useful inequality. 


(5.3) If the series >- 7, a; converges, then |>> 7m ai) < Dvem ail. 


The result is true even if >-;_,, a; is not absolutely convergent, for in 


that case > ;_,, |a;| = oo, and the inequality becomes |>°;_,, a;| < 0. 


Proof. In view of the preceding remark, we shall assume throughout the proof that 


e.6) . : 
> i-m {a:| converges. Let {s,} be the sequence of partial sums corresponding 
to the series }\;~» a. Then 


n 
se > Ai, for every integer n > m, 
=m 


and the assumption that >-;_,,, a; converges means that the sequence {s,} 
converges and that 


ee) 


iis, = > ai. ee. 


ne : 
Im 


The general fact that |a + b| < |a| + |d|, for any two real numbers a and 5, 
can be extended to any finite number of summands, and we therefore have 


n n 
> a = 2, la;|. 
i= 


=m 


[Sn] = 


Furthermore, 
3 ia = > la;| 
[see (3.2), page 490, and (1.3), page 478). Hence 


oO 
set Se 2, la;|, for every integer n > m. (8 ) 


It follows from (8) that 


ie) 


‘ins, | = 2 ja ;\. (9 ) 


na é 
=m 


[It is easy to see that (8) implies (9) if we regard the numbers s, and ae la;| 
as points on the line. The geometric statement of (8) is that all the points s,, 
lie in the closed interval whose endpoints are —>>;—» |a;| and )\;_» |a;|. If 
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(9) were false, it would mean that lim s, lay outside this interval, a positive 
n—-0 


distance away from it. But this cannot happen, since s, is arbitrarily close to 
lim s, for n sufficiently large.] Combining (7) and (9), we obtain the in- 


n—- oO 


equality which was to be proved. 


PROBLEMS 


1. Classify each of the following infinite series as absolutely convergent, condition- 
ally convergent, or divergent. Show how you obtain your answer starting from a 
standard test or series. 


oe) 2 1 od) : 5” 
(a) 2800 Ee ) Pah a 
= 1 =. 100° 
(b) LUE ha es (f) is 
= k 1 k k! 
© DCU Egy (g) ze D: gar 
@ ok ee 
ca: 4=] 


ie) 
Se 
2. (a) Prove that the series > = is absolutely convergent. 
n=0 


— 
(b) Prove that lim — = 0. 
nO é 
3. Classify each of the following series as absolutely convergent, conditionally 
convergent, or divergent. Show how you obtain your answer. 


A==l1 
00 : 1 
(b) >> (-1) nn (d) in 8 
n=1 cs 


ee) 
= es 
4. (a) Prove that, for every positive number a, the series > ae absolutely 
convergent. jos Jel 


ae: hy it3 
(b) Prove that lim — = O for every positive number a. 
nwo 
5. The infinite series 


: = 1 1 1 1 
= =a a ae Be Be aaa ae es paisa eteaisanas eee 
24a = 145+ 573+ 55 + gage + 333 + gags t 
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is defined, for every integer n > 0, by the two equations: 


1 
2n3n 


1 
Anti = ant130 : 


42, = 


(a) Show that >>) an is absolutely convergent. 


(b) What is lim Jansal 9 


ee 


6. As a corollary of (5.1), prove the following extension of the Comparison Test: 
The series >; ~m a; is absolutely convergent if there exists an absolutely convergent 
. 00 
series >\;-, b; such that |a;| < |b;| eventually. 


6. Power Series. Associated with every infinite sequence of real numbers 
do, 41, 2, ... and every real number x, there is the infinite series 


dy + ax + Ox + -°°. 


Such a series is called a power series in x. As a general rule, it will converge 
for some values of x, but not all. For example, the geometric series 


ltxtxert+xwsteoe 


converges and has the same value as for every real number x in 


See 

absolute value less than 1, but it diverges for all other values of x. A power 
series which converges for some real number c, i.e., which converges if 
x = c, is commonly said to converge at c. Note that every power series in x 
converges at 0, since, if x = 0, then 


Dae = my shia ‘ss = Qo. 
1=0 


The following proposition is the basic theorem in studying the con- 
vergence of power series: 


(6.1) If a power series Yj.) a;x' converges for some real number c, then it is 
absolutely convergent for every real number x such that |x| < |c|. 
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Proof. If c = 0, the result is vacuously true, so we shall assume that c # 0. The 
fact that the series >-;_, a,c’ converges implies that lim a,c” = 0. Hence 


n— 


there exists a nonnegative integer N such that |a,c”| < 1, for every integer 


n> N. Since |anc”| = |a,| |c|", it follows that 
1 
Ceca 
|an| a le|* 
and thence that 
Ix|” 
n n 
I = < ? 
janx"| = lanl eI" <7 


for every real number x and for every integer n > N. We now impose the 


Ix 


restriction that |x| < |c|, and set r = ——-. Thenr < 1, and 


Ic| 
|anx"| <r”, for every integer n > N. 


That is, we have shown that |a,x”| <r” eventually. Since the geometric 
series >\;_) r’ converges if |r| < 1, it follows by the Comparison Test that 
>i |aix'| converges. This completes the proof. 


We shall derive three corollaries of (6.1). The first asserts that the set of 
all real numbers x at which a power series >~j--) a;x' converges is a nonempty 
interval on the real line. The set is nonempty because, as is remarked above, it 
contains the number 0. A set of real numbers is an interval if, whenever it 
contains two numbers, it contains every number in between those two. Thus 
we must prove that if the series converges at aand atc andifa < b < c, then 
it also converges at b. This is quickly done. Suppose first that b > 0. Then 


jb) SO = Jel, 


and (6.1) implies that the series converges at b. On the other hand, if b < 0, 
then : 
jb] = —b < —a = |al, 


and it again follows from (6.1) that the series converges at b. This completes 
the proof, and, as a result, we call the set of all numbers at which a power 
series converges the interval of convergence of the power series. 

A number a is called an interior point of a set S of real numbers if there 
exists an open interval which contains a and which is a subset of S. The set 
of all interior points of S is called the interior of S. For example, if S is itself 
an open interval, then all its points are interior points and hence S equals its 
own interior. More generally, the interior of an arbitrary interval consists 
of the interval with its endpoints deleted. 

The second corollary states that a power series converges absolutely at 
every interior point of its interval of convergence. The proof is virtually the 
same as that of the first corollary. Let 6 be an arbitrary interior point of the 
interval of converence. Because it is an interior point, we know there exist 
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real numbers a and c which also lie in the interval and for whicha < b < c¢. 
As before, if b > 0, then |b] < |c|, but if 5 < 0, then |b] < jaj. In either 
case it follows from (6.1) that the series converges absolutely at 5, and this 
completes the argument. 

The third corollary is the following: The interior of the interval I of 
convergence of a power series >~;~) a;x' is symmetric about the origin. That 
is, if b is an interior point of J, then so is —b. Again, there exist numbers a 
and cin J such that a < 6 < c. Nowconsider the open interval (—c, —a). 
It certainly contains —b, and, if we can show that (—c, —a) is a subset of J, 
then we shall have proved that —d is an interior point of J. Let x be an 
arbitrary number in (—c, —a), that is, —c < x < —a. There are the, by 
now familiar, two possibilities: If x > 0, then —a > 0 and 


ja) ieee = a). 
If x < 0, then —c < 0, whence c > O, and 
f= ee C= el 


For either possibility, the convergence of the series at x is implied by (6.1), 
and so the third corollary is proved. 

If a power series converges for every real number, then its interval of 
convergence is the set of all real numbers. The only other possibility, 
according to the third corollary above, is that the interval of convergence is 
bounded with symmetrically located endpoints —p and p. We define the 
radius of convergence of a power series >_;~) a;x' to be infinite if the interval 
of convergence is the set (— », ~) of all real numbers, and to be the right 
endpoint of the interval of convergence if the interval is bounded. The 
preceding results are then summarized in Figure 6 below and in the following 
theorem: 


radius of 


interval of convergence convergence 


divergence < divergence 


x-axis 


= 0 0 


Figure 6 


(6.2) THEOREM. If a power series > j~) a;x' has radius of convergence p, then 
the series converges absolutely at every x in the open interval (—p, p). If p is 
not infinite, then —p and p are the endpoints of the interval of convergence. 


It is important to realize that, when p is finite, we have made no predic- 
tion as to whether the series converges or diverges at p and at —p. All we 
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know is that it converges absolutely in the open interval (—p, p) and diverges 
outside the closed interval [—p, p]. 


EXAMPLE |. Find the interval and radius of convergence of each of the follow- 
ing power series: 


ce x” x? 
(a) oF meee a: 
eo) 2 3 
k—-1X x We 
(b) ee Sit got S 


(c) dmx” = 14+ x+4+ 2x? 4+ 3x +--- 


n=0 


In many examples the radius of convergence can be found easily using the 
Ratio Test. 


2 oe, < 
For the series in (a), we set u; = a and form the ratio 
i! 


gE ees Ss a ae = Hel 
|Un| (n + I)!] [xn (1 £ + 1)! 
Since (1 + 1)! = (2 + In!, 
[Mnpa) | ee 
|u| n+ 1 
Hence 
vo, et ge 
rang ee 


It follows from the Ratio Test that the series = is absolutely convergent 
=0 
for every real number x. Hence the interval of sigs So leak is the entire real 


line, and the radius of convergence is infinite. 


k 
Let uw, = (—1)*7 = for the series in (b). For every integer k > 1, we 


|" 
have |u;| = == Hence 
[Magid a bi ie Ba ae. 
Since 
k l 
eo, tee eee 


kx <3 
La 
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we obtain 


ina ‘Wes| = 
ko lu uz| 


0 k 
The Ratio Test therefore implies that the series > (1S converges 


k=0 
absolutely if |x| < 1 and diverges if |x| > 1. It follows that the endpoints 
of the interval of convergence are the numbers — 1 and 1 and that the radius 
of convergence is 1. If x = 1, the series becomes 


fo 6) ao | 
SE ee Be ee ge a ae 
k=1 


which is the convergent alternating harmonic series. On the other hand, if 
x = —1, the series becomes 


Be densty eaptooe Stenayictys 


Since 2k — 1 is always an odd integer, we have (—1)**—' = —1, and so 
. ees — | 
= | ae eee = oe = 
Bye Bek 


which wa Hence the interval of convergence of the power series 


> enh g -17_ is the half-open interval (—1, 1]. 


For the series in (c), let u, = n!x”. We then get 


|unps| — [@e + 1x? t 
ae ey RE 
It follows that 
DL, ix <D, 
lim iibival <= lim (a + 1)|x| = 


n—0 | U,| nn 0, ix = Q. 
From the Ratio Test we conclude that the series >>, m!x” converges only at 
x = 0. The radius of convergence is therefore equal to 0, and the interval 
of convergence contains the one number 0. 


A significant generalization of the definition of power series can be made 
as follows: Consider an arbitrary real number a and an infinite sequence of 
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real numbers dp, a1, do, .... For every real number x, a power series in x — a 
is defined by 


Se ae 


The power series in x studied earlier in this section are simply instances of the 
present definition for which a = 0. 

Fortunately, it is not necessary to start from the beginning again to 
develop the theory of convergence of power series in x — a. Consider the 
power series in y obtained by making the substitution x — a = y in the 
series (1). We obtain 


ie. ¢) 


> 8 = ai $a + on See (2) 
i=0 
Let J be the set of all real numbers y for which (2) converges, 1.e., the interval 
of convergence of the power series (2). Similarly, let J be the set of all real 
numbers x for which (1) converges. Since x — a = y, or equivalently, 
x = y + a, a number 6 will belong to / if and only if 6 + a belongs to J. 
Thus the set J consists of all numbers of the form 6 + a, where b belongs to J. 


Symbolically, we write 
J=I+a. 


( radius of 
convergence ‘ 
ES oe 


— —_ % B. : =z — 5 - ’ S, 
interval of interval of 


convergence of convergence of 
co oo 
> ayy! >: a;(x— a)’ 
i=0 i=0 
Figure 7 


Geometrically, therefore, the set J is obtained by translating the interval / 
along the real line a distance |a|, translating to the right if a > 0, and to the 
left if a < 0 (see Figure 7). Hence, the set J of all real numbers x for which 
Dizo a(x — a)’ converges is also an interval, and is called the interval of 
convergence of that series. The corresponding radius of convergence is equal 
to the radius of convergence of the series (2), but this time it should be 
interpreted (if it is not infinite) as the distance from the number a to the right 
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endpoint of the interval of convergence J. The interior of J is, of course, 
symmetric about the number a. 

A power series >-;2) a,(x — a)' is frequently called a power series about 
the number a. If we recall that |x — a] is geometrically equal to the distance 
between x and a, we see that the appropriate analogue of theorem (6.2) is: 


(6.3) THEOREM. Jf the power series >. 72) a;(x — a)' has radius of convergence 
p, then the series converges absolutely at every x such that |x — a| < p. Ifp 
is not infinite, the series diverges at every x such that |x — al > p. 


EXAMPLE 2. Find the radius and interval of convergence of the power series 


Di fe 4-2) -i5 72 52> 


Observe, first of all, that this is a power series about the number —2. That is, 
to put it in the standard form (1), we must write the series as 


dye - (-9)' 


Let us set u, = = Ls + 2)” and apply the Ratio Test. For every integer 
n > 0, we obtain 
Ee a ee 
|u| — (a 2)" 2 
Hence 
gq = lim [unsal _ ere 
no = ig 3 


The series is therefore absolutely convergent if g < 1, or, equivalently, if 
ix + 2| < 3, and it is divergent if g > 1, or, equivalently, if |x + 2| > 3. 
Remember that |x + 2| is the distance between x and —2. Thus the interior 
of the interval of convergence is the set of all real numbers whose distance 
from —2 is less than 3. Hence the radius of convergence is equal to 3, and 
the endpoints of the interval of convergence are the numbers —5 and lI. 
If x = —5, the series becomes 


oa pete G3) 
ee ee 


= a ee 
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which diverges. If x = 1, we also obtain a divergent series, 


wd lines = ree xe 
ie cram — alee 


+=0 


It follows that the interval of convergence of the power series 


Veet 2 
7=0 


is the open interval (—5, 1) (see Figure 8). 


radius of 
// convergens a 


= sgt af real line 
interval of convergence (—S, 1) 


Figure 8 


PROBLEMS 


1. Find the radius of convergence of each of the following power series. 


eee e): 2,8 
ae Be Ce ae 
i=l 1=0 


ender ee @) doix' 


oO : [o@) y 
oy O QiageE DR 


2. Find the interval of convergence of each of the power series in Problem 1. 
3. Is the following statement true or false: Every power series > \;_» a;x’ converges 
absolutely only in the interior of its interval of convergence? Why? 
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4. Find the radius of convergence of each of the following power series. 


ora ae: a oe . 
oa @ De e-y 

i=1 n=0 

CO Pie 2 t (oe) 2 : 
OD © VSor+n 

i= ‘i ci! 
(c) 3 ey ij ya (f) +" ‘ay okt | 
Lak +1" a (2k + 1)! 


5. Find the interval of convergence of each of the power series in Problem 4. 
lanl 


6. Prove the following: If lim = p, then the radius of convergence of the 


n—0 |a,,| 


os ie 1 1 
power series >.;-) a(x — a)’ is equal to -. (Assume that et co and that 
p 


as ) 
CO 


7. Functions Defined by Power Series. For every power series 
bs a(x ae a)’, 
i=0 


the function defined by the power series is the function f which, to every real 
number c at which the power series converges, assigns the real number f(c) 
given by 


fle) = Yi aie — ay. 


The domain of fis obviously equal to the interval of convergence of the power 
series. Speaking more casually, we say simply that the function fis defined 
by the equation 


fons Sera. 


i=0 
As an example, let f be the function defined by 


3 


PEE Dia lo-paansiagetaey stg : 
i. : ; 


This power series was studied in Example 1 of Section 6 and was shown to 
converge for all values of x. Thus the domain of the function which the 
series defines is the set of all real numbers. 
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Functions defined by power series have excellent analytic properties. 
One of the most important is the fact that every such function is differentiable 
and that its derivative is the function defined by the power series obtained by 
differentiating the original series term by term. That is, if 


f(®) = YS ax — a)! = ay + a(x — a) + a(x — a) $e, 


then 


f(x) = = jee =O =a ae a ee = 


i=1 
This is not a trivial result. To prove it, we begin with the following theorem: 
(7.1) A power series >. ;~, a(x — a)' and its derived series Y\;, ia;(x — a)’ 
have the same radius of convergence. 


In Section 6 we showed that the essential difference between the power 
series }/{, a,(x — a)’ and the corresponding series >>;2, a;x' is that the 
interval of convergence of one is obtained from that of the other by transla- 
tion. In particular, both series have the same radius of convergence. To 
prove (7.1), it is therefore sufficient (and notationally easier) to prove the 
same result for power series about the origin 0. We shall therefore prove the 
following: Jf the power series >\;~., a;x' has radius of convergence p and if the 
derived series \~;"_, ia;x'—' has radius of convergence p’, then p = p’. 


Proof. Suppose that p < p’, and let c be an arbitrary real number such that 


. oO . . - 
p<c <p’. Then the series > ;_,) aic' diverges, whereas the series 
> j-1 ia;c'—! converges absolutely. Since c is positive, 


ice) oO 

; 7—1 ; a 

C ) ae. = > axe |, 
1=1 i=] 


and it follows that the series >)7-, ia,c‘ is also absolutely convergent. How- 
ever, it is obvious that, for every positive integer /, 


laic’| < ilaic’| = |iayc’). 


The Comparison Test therefore implies that the series >;_, |a;c‘| converges, 
and this fact implies the convergence of >-;_,) a,c’, which is a contradiction. 
Hence the original assumption is false, and we conclude that 


ame (1) 


Next, suppose that p’ < p. We shall derive a contradiction from this 
assumption also, which, together with the inequality (1), proves that p’ = p. 
Let 5 and c be any two real numbers such that p’ < b < c < p. It follows 
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from the definition of p’ that the series Seri ia;b'—' diverges. Similarly, 
from the definition of p, we know that the series >-;_) a;c’ converges, and 
therefore lim a;c' = 0. Because c is positive, it follows that there exists a 


1— 00 


positive integer N such that, for every integer 7 > N, 


laic'| <c. 
But |a,c‘| = |a,\c’, and so the preceding inequality becomes |a,|c’ < c, or, 
equivalently, 
1 
Oe eae oe 


ra = 


Hence, since 4 is also positive, we obtain 


1— oe b i 
liaib' "| = ib’ “lai < oa = i(’) 
¢ 


b 
for every integer i > N. Let us set- = r. Then 0 <r < 1, and we have 
shown that e 

lia’ "| < i, for every integer i > N. 

However, it is shown in Example 3, page 508, that the series }°;_, ir’! 
converges if |r| < 1. Hence the preceding inequality and the Comparison 
Text imply that the series }-;_, |ia;b'—'| converges, and this contradicts the 
above conclusion that >~;_, ia;b'—! diverges. This completes the proof that 
po’ = p, and, as we have remarked, also proves (7.1). 


Note that Theorem (7.1) does not state that a power series 
5.) a(x — a)' and its derived series have the same interval of oe 
but only that they have the same radius of convergence. For example, 1 
Example sea page 514, the interval of convergence of the power series 


3 = _ is shown to be the half-open interval (—1, 1]. However, the 
k=1 
derived series is 


:3 Sas | —— Se oa = k—1 


k=1 k=1 


=J—x+tx—x+---, 


which does not converge for x = 1. It is a geometric series having the open 
interval of convergence (—1, 1). 

Let 1°, a;(x — a)’ be a power series, and let f and g be the two func- 
tions defined respectively by /(x) = Dy) a(x — a)’ and by g(x) = 
>, ladx — a). We have proved that there is an interval, which, with the 
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possible exception of its endpoints, is the common domain of fand g. How- 
ever, we have not yet proved that the function g is the derivative of the func- 
tion f. This fact is the content of the following theorem. 


(7.2) THEOREM. I/f the radius of convergence p of the power series 
Diz a(x — a)’ is not zero, then the function f defined by f(x) = 
Dio a(x — a)' is differentiable at every x such that |x — a| < p and 


fx) = Dia — a) 


I=1 


Proof. It is a direct consequence of the Chain Rule that if (7.2) is proved fora = 0, 
then it is true in general. We shall therefore assume that f(x) = oj) aix’. 
Let g be the function defined by g(x) = >°)_, ia;x‘—!, and let c be an arbitrary 
number such that |c| < p. We must prove that f’(c), which can be defined by 


f(x) — ©) 
7 oe 


f'tr= ima 


exists and is equal to g(c). Hence the proof is complete when we show that 
f(x) — fle 
lim (M20 _ x0) ae 7§) (2) 
t+ Tae 

Let d be an arbitrary real number such that |c| < d < p (see Figure 9). 

=-d c nae x d 
=p 0 p 
Figure 9 


Henceforth, we shall consider only values of x which lie in the closed interval 
[—d,d]. For every such x other than c, we have 


fo-fe 1 (Ss 1 Ss 3 
a (> spite Yo ax') 


oa) (5 _ “) 
d z aS : 
_ x—C 
For each integer i > 1, we apply the Mean Value Theorem to the function 
x', whose derivative is ix'—!. The conclusion is that, for each /, there exists a 


real number c; in the open interval whose endpoints are c and x such that 
x' — cl = ict “Wx = ©), Hence 
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and so 
Fi 0)-— $le)——= mt 
ay oe Da fai 


From this it follows that 


po) — St f(c) — g(c) - »» jes a 1 nat ae a 


x-— Cc 
— » igdce > _ a} 


4—=2 


For each integer i > 2, we now apply the Mean Value Theorem to the 
function x‘—!, whose derivative is (i — 1)x‘~*. We conclude that there exists 
a real — b; in the open interval whose endpoints are c and c; such that 


poe = — be, te, = cf. - 


Hence 


ee! ga = > ii — Iai “(ci — ©). 


Ke — 
i t=2 


Since |c; — c| < |x —c|, for every i, we obtain, using Theorem (5.3), 
page 509, 


= gfe) 


cae 


| f(x) — fe) ee tas 
| < |x al Zeal 1)a,b; |. (3) 


4 


Two applications of (7.1) imply that the power series >-;_. ii — Dax‘, 
which is the derived series of >~;_; ia;x'~!, also has radius of convergence 
equal to p, and it is therefore absolutely convergent for x = d. Moreover, 
'b;| < d for each i, and so 


ee ek. 1d tae | 


for every integer i > 2. It follows from the Comparison Test that 


Ye = pawl 1s We Daa a (4) 


aoe 
| 
bo 


The value of the series on the right in (4) does not depend on x, and we denote 
it by M. Combining (3) and (4), we therefore finally obtain 


2 ee, 


’ fea & 


— g(c)) < |x — c| M. 


The left side of this inequality can be made arbitrarily small by taking |x — c| 
sufficiently small. But this is precisely the meaning of the assertion in (2), 
and so the proof is finished. 
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EXAMPLE |. (a) Show that 


00 k 
Ini + x)= >> (=yi"<- ie 
k=1 


for every real number x such that |x| < 1. 


00 a 


For (a), let f be the function defined by f(x) = 2 = We have already 


shown that the domain of f is the set of all real iieebere: 1.e., the radius of 
convergence is infinite. It follows from the preceding theorem that 
0 t—1 
f(x) = > i — , for every real number x. 


41=1 
i l 
— = —-———_ > we obtain 
i! (i — 1)! 


oO ° co (ee) 
2 1X ae: Ne XxX 
mee > Soe =) ot 
t=1 l t=1 Ee k=0 Kt 


where the last equation is obtained by replacing i — 1 by k. Thus we have 
proved that 
= eee) for every real number x. 


‘ > 


o—» 


; ee: 
The function f therefore satisfies the differential equation = = yp, whose 
x 


general solution is y = ce”. Hence f(x) = ce* for some constant c. But it is 
obvious from the series which defines f that f(0) = 1. It follows that c = Il, 
and (a) is proved. 


A similar technique is used for (b). Let f(x) = 3 (— pin = . The 


domain of fis the half-open interval (—1, 1], and the adie of eee 
is. 1. lence 


Se ie. os mda Lys *x —1_k-—1 


=l—x+x—x4---, 


f'(x) 
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for |x| < 1. The latter is a geometric series with sum equal to , 


. Hence 
we have shown that is 


ee | 
f'@) = aa for every x such that |x| < 1. 


Integration yields 


dx 
fo 


Since |x| < 1, we have |1 + x| = (1 + x). From the series which defines f 
we see that f(0) = 0. Hence 


B= (OO) ind +0) = ce = OF c= &. 


It follows that f(x) = In(1 + x) for every real number x such that |x| < 1, 
and (b) is therefore established. ‘ 


i) = = Injl+ x| +c. 


Example | illustrates an soe point. The domain of the function f 
defined by f(x) = Ps (—1)*— 1” is the half-open interval (—1, 1]. On the 


other hand, the donmnii of the ae In(1 + x) is the unbounded interval 
(—1, ~). It is essential to realize that the equation 


In(l + x) = Ee Be uh 


has been shown to hold only for values of x which are interior points of the 
interval of convergence of the power series. It certainly does not hold at points 
outside the interval of convergence where the series diverges. As far as the 
endpoints of the interval are concerned, it can be proved that a function 
defined by a power series is continuous at every point of its interval of 
convergence. Hence the above equation does in fact hold for x = 1, and we 
therefore obtain the following formula for the sum of the alternating harmonic 
series: 


es 1 
in? = {f= =] 
n 2 ( a 


Let 302, a;(x — a)’ be a power series with a nonzero radius of conver- 
gence p, and let f be the function defined by the power series 


f(x) = >» a(x bess a)’, 
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for every x in the interval of convergence. By iterated applications of 
Theorem (7.2), 1.e., first to the series, then to the derived series, then to the 
derived series of the derived series, etc., we may conclude that f has deriv- 
atives of arbitrarily high order within the radius of convergence. The formula 
for the nth derivative is easily seen to be 

(7.3) fR%) = WiG- 1)---@-— at Dax — a)™, 

for every x such that |x — a| < p. 

Is it possible for a function f to be defined by two different power series 
about the same number a? The answer is no, provided the domain of f is not 
just the single number a. The reason, as the following corollary of Theo- 
rem (7.3) shows, is that the coefficients of any power series about a which 
defines f are uniquely determined by the function f. 


(7.4) If f(x) = 2) a(x — a)' and if the radius of convergence of the power 
series is not zero, then, for every integer n > 0, 


Eo 
an = oa (a). 


[By the zeroth derivative f we mean simply f itself. Hence, for n = 0, 
the conclusion is the obviously true equation ad) = f(a).] 


Proof. The radius of convergence p is not zero, and so the formula in (7.3) holds. 


“| 
Since ii — 1)--:-@—n+1) = —""___. we obtain 
(i — n)! 


a) 
Ms 5, ae ay ter, 


= Nan + (a + I)langi(x — a) + 


for every x such that |x — a| < p. Setting x = a, we obtain 


£@ = nan, 


from which the desired conclusion follows. 


PROBLEMS 
1. Let f be the function defined by 


0 t 
x 


fe) = oS 


tami 


(a) Find the radius of convergence of the power series and also the domain of f. 
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(b) Write the derived series, and find its radius of convergence directly [i.e., 
verify Theorem (7.1) for this particular series]. 
(c) Find the domain of the function defined by the derived series. 


. Let f be the function defined by f(x) = +2 = (x — 2)’, and follow the same 
1 v 


jee 
instructions as in Problem 1. 


. Let f be the function defined by f(x) = 5 (x — 2)', and follow the same 


a=) | 
instructions as in Problem 1. 


, and follow the same 


. Let f be the function defined by f(x) = a a 7 } 
, i 


instructions as in Problem 1. 
. Find the domains of the functions f and g defined by the following power series. 


es 2i— Ay AD 
(a) f(x) = CS an 24 @) a= DG 
! 0 e 


. If f and g are the two functions defined in Problem 5, show that 

(a) f’(x) = g(x) cere xy =" FO). 

. Let f and g be the two functions defined in Problem 5 (see also Problem 6). 
(a) Evaluate (0), f’(0), g(0), and g’(0). 


(b) Show that f and g are both solutions of the differential equation 
3 y 


(c) Write the general solution of the differential equation in (b), and thence, 
using the results of part (a), show that f(x) = sin x and that g(x) = cos x. 


. Show, as is claimed at the beginning of the proof of Theorem (7.2), that it is a 
direct consequence of the Chain Rule that if this theorem is proved for a = 0, 
then it is true for an arbitrary real number a. 

. Prove that every power series can be integrated, term by term. Specifically, 
prove the following two theorems. 


(a) A power series \\;~) a(x — a)' and its integrated series 


= aj i i+1 


have the same radius of convergence. 


(b) Jf the radius of convergence p of the power series >>; 9 a(x — a)! is not 
zero and if f and F are the functions defined, respectively, by 


ie.¢) [e) 


fx) = Yoade a) and F@) = Ve - a), 
t=O 


1=0 
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then 


F@y= | se9.ax + c¢. 


10. Starting from the geometric series 
2 eDs = l—-x +x =— 8 +e 
i=0 


and the results of Problem 9, show that 


arctan x = 23 aan = 


7=0 


for every x such that |x| < 1. 
11. (a) Show that the interval of convergence of the integrated series in Problem 10 
is the closed interval [—1, 1]. 


(b) True or false? 


8. Taylor Series. The subject of Section 7 was the function defined by a 
given power series. In contrast, in this section we start with a given function 
and ask whether or not there exists a power series which defines it. More 
precisely, if f is a function containing the number a in its domain, then does 
there exist a power series )) 2.) a;(x — a)’ with nonzero radius of convergence 
which defines a function equal to fon the interval of convergence of the power 
series? If the answer is yes, then the power series is uniquely determined. 


Se 
4: (ayer 


Specifically, it follows from Theorem (7.4), page 526, that a; = — 
i! 


every integer 7 — U HMence 


f@) = Og fP@ - a) 


= fla) + f@x - a) + yaa = ate, 


for every x in the interval of convergence of the power series. 
Let f be a function which has derivatives of every order at a. The power 


ss 1 oe = ; 
series ye = f(a)(x — a)' is called the Taylor series of the function f about 
pains 


the number a. The existence of this series, whose definition is motivated by 
the preceding paragraph, requires only the existence of every derivative 
f(a). However, the natural inference that the existence of the Taylor series 
for a given function implies the convergence of the series to the values of the 
function is false. In a later theorem we shall give an additional condition 
which makes the inference true. Two examples of Taylor series are the 


ee = a Ss ea 
A Ne 
(-)-1= OU 
T= (DS = OL 
UIR}IGO DIOJOIOY} OM “(]) UI UOTTIUYap dy} WOLy 
Soe Cas SOUIYM : - ieee as (53 
< = ies sOUNYM : = = (Et 


‘1— = (1),f soucqm . — = (x),f 


[O18 SOAT} 
-ALIap oy, ‘f Jo ydeis oy} UO Yyoea Jo ydeis oy} ssoduiiadns pur *] Jaquinu 
ay} jnogqe f uonMouny sy} Joy “yz [eruoud]od Jo[AR] ay] ajnduioo ‘¢ pur 


x 
ZT ‘1 ‘0 = 4 log 2 (x) f Aq pouysp uorounj 941 oq f 127 “| ATdWVXxa 


Uses" “Q = ¥ Yona 40f Muy = Mop 
sayfsijps p Jnoqgn f uoloun{f ay) oJ “J uoljouixosddp sojdv Yu ay] (18) 
:yNSor JuRJIOdWI yng [dus 9y} DAVY 
OM SHYT “(Y) yf 0} Jenba si qt yey} SMOYs “YY JeIWIOUATOd IO[ABT, dy} JO Y Je 


SATPBALIOP YI 9Y} JO UOTJeINdWIOdD Joop ‘US *** ‘QO = ¥ JABdJUT YORI IOF 
‘x Jaquinu [Bat AI9A9D IOJ 


Cy 


(Oe WD and +2 +@ =e 


0=2 


AC, oe YO) fH es = (zy 5 


u 


Aq udals “7 [eruouATOd 3y} SI v JoquNU 9y} Inoge / UOTIOUN] 94} 
0} UoleWMTxoidde JO[AB], yyw oy} USYT, “UV ‘°° “Q = 7 1989}UT AIDA9 IOJ S}SIXO 
(DO), f yey} Yons uoNouNs ve f pUL JaBdJUT BAT}RSaUUOU B 9q WU 19T :SMOTIOJ 
se sl uoluyop oy ““y Aq o}OUap ][eYys 9M YOIyM ‘(0 — X) UT [eIUOUATOd 


+) O=2 
B stuns [ered yyw ayy" (0 — X)\O) af 5 4 SOLIOS IOJABT BJO ‘stuns [elyied 


Sui Aq poyeulxoidde oq Uvd SAIIOS SHUYUI JUISIOAUOD B JO ONTRA SUL 
"LZ Woloag 
jo | gdwexq ul podopaasp (x + ][)UT JO} Salas oy} pue ,a IO} SOLS 


6cS SAJIYUAS UOTAVI [Q ‘OAS 
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These equations express the functions 7, as polynomials in x — | rather 
than as polynomials in x. The advantage of this form is that it exhibits most 
clearly the behavior of each approximation in the vicinity of the number 1. 
Each one can, of course, be expanded to get a polynomial in x. If we do this, 
we obtain 

T(x) = a 

PAX sax 4o2, 

TGQ) = x" "= 3x 3: 

T(x) = —x® + 4x? — 6x + 4. 
The graphs are shown in Figure 10. The larger the degree of the approximat- 


ing polynomial, the more closely its graph “hugs” the graph of f for values 
of x close to I. 


y-axis 
a 
PTS IO) 
y.= T,(x) 
5. ~ y = T,(x) 
X-axis 
y= 7,(%) 
y = T;(x) 


Figure 10 


The basic relation between a function f and the approximating Taylor 
polynomials 7,, will be presented in Theorem (8.3). In proving it, we shall 
use the following lemma, which is an extension of Rolle’s Theorem (see 
pages 111 and 112). 


sec. 8] TAYLOR SERIES 531 


(8.2) Let F be a function with the property that the (n + 1)st derivative 
F'*)(?) exists for every t in a closed interval [a, b], wherea < b. If 


i) F’(@)=0, fori=0,1,...,n, and 
(i) F() = 0, 


then there exists a real number c in the open interval (a,b) such that 
F&M (ec) 2% 


Proof. The idea of the proof is to apply Rolle’s Theorem over and over again, start- 
ing with 7 = O and finishing with i = n. (In checking the continuity require- 
ments of Rolle’s Theorem, remember that if a function has a derivative at a 
point, then it is continuous there.) A proof by induction on 7 proceeds as 
follows: If n = O, the result is a direct consequence of Rolle’s Theorem. 
We must next prove from the assumption that if the lemma is true forn = k, 
then it is also true for n = k + 1. Thus we assume that there exists a real 
number c in the open interval (a, b) such that F“+?(c) = 0 and shall prove 
that there exists another real number c’ in (a, 6) such that F“*°(c’) = 0. 
The hypotheses of (8.2) with m = k + 1 assure us that F“*?)(f) exists for 
every ¢ in Ja, b] and that F“*+(a) = 0. The function F“ Tt” satisfies the prem- 
ises of Rolle’s Theorem, since it is continuous on [a,c], differentiable on 
(a,c), and F&+)(q) = F@+)(c) = 0. Hence there exists a real number c’ 
in (a,c) with the property that F“*?(c’) = 0. Since (a,c) is a subset of 
(a, b), the number c’ is also in (a, b), and this completes the proof. 


We come now to the main theorem of the section: 


(8.3) TAYLOR’S THEOREM. Let f be a function with the property that the 
(n + 1)st derivative ft” (t) exists for every t in the closed interval [a, b], 
where a < b. Then there exists a real number c in the open interval (a, b) 


such that 
n 


l a v 
fH=> ft OG — 2. 
i=0 ~ 
where 
| 


Ry — (n+ 1) f2(O@ fais gyeeh 


Using the approximating Taylor polynomials, we can write the conclu- 
sion of this theorem equivalently as 


sae Stee 


a + DI f°" (ob a a. Ee 


Note that the particular value of c depends not only on the function fand the 
numbers a and 6 but also on the integer n. 
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Proof. Let the real number K be defined by the equation 
iO) = 7,0) - ko ae. 
The proof of Taylor’s Theorem is completed by showing that 


> 1 
att se 1! 


for some real number c in (a, 5). For this purpose, we define a new function 
F by setting 


K f° ey, 


FOS “TOY KE = ar, 
for every ¢ in [a, b]. From the equation which defines K, it follows at once that 
OO) TR) — 2a 


and hence that F(5) = 0. In computing the derivatives of the function F, we 
observe that any derivative of K(t — a)”*! of order less than n + 1 will 
contain a factor of t — a, and therefore 
d' : 
aa Kt — a) T = 9% fori = 0,...50. 
Since f(a) = T,(a), for every integer i = 0,...,m, [see (8.1)], we con- 
clude that 


FE’ (@)=f ah Lo Gh ae ae <i ae 
Hence, by Lemma (8.2), there exists a real number c in (a, 5) such that 
uma —e 


Finally, we compute F‘" +) (r) for an arbitrary ¢ in [a, b]. Since the degree of 
the polynomial 7, is at most n, its (n + 1)st derivative must be zero. More- 
over, the (n + 1)st derivative of K(t — a)"*! is equal to (n + 1)!K. Hence 


- FeO = fo =e es 
Letting t = c, we obtain 
0 = Fee) = f+) — (n + DK, 


1 


from which it follows that K = ————— 
a ie 


f+) (ce). This completes the proof. 


It has been assumed in the statement and proof of Taylor’s Theorem 
that a < b. However, if 6 < a, the same statement is true except that the 
(n + 1)st derivative exists in [b, a] and the number c lies in (5, a). Except 
for the obvious modifications, the proof is identical to the one given. Suppose 
now that we are given a function f such that f‘”*” exists at every point of 
some interval J containing the number a. Since Taylor’s Theorem holds for 
any choice of 6 in J other than a, we may regard 6 as the value of a variable. 
If we denote the variable by x, we have: 
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(8.4) ALTERNATIVE FORM OF TAYLOR’S THEOREM. Jf f"*? (1) exists for 
every t in an interval I containing the number a, then, for every x in I other than 
a, there exists a real number c in the open interval with endpoints a and x such 
that 


f(x) = fla) + f(x = a) + + FA = 4)" + Re, 


where 
Re = ae ele — a 


The conclusion of this theorem is frequently called Taylor’s Formula 
and R,, is called the Remainder. As before, using the notation for the ap- 
proximating Taylor polynomials, we can write the formula succinctly as 


f(x) = T,(x) + Rn. : (3) 


EXAMPLE 2. (a) Compute Taylor’s Formula with the Remainder where 
f(x) = sin x, a = 0, and 7 is arbitrary. (b) Draw the graphs of sin x and 
of the polynomials 7,,(x), forn = 1, 2, and 3. (c) Prove that, for every real 


number x, 
ss 3 5 7 


x x x 
sinx = =x- s+ 2>-s5 
& OF = ty 3! cee 
The first four derivatives are given by 
é Six = cos. x 
dx ( 
2 
: ae 
dx Ser ei Si) X, 
d’ 
ig SIX = —COS x, 
a se == SE x 
dx ; 


Thus the derivatives of sin x follow a regular cycle which repeats after every 
fourth derivation. In general, the even-order derivatives are given by 
4 jae eel sin pee A Bo a 
ca lee = Ly winx, l Nps 
and the odd-order derivatives by 
21—1 


t—l meres 
Aaa sn x.=—.(— IL cos x, ee ep Ae See 


534 INFINITE SERIES  [cHap. 9 


If we set f(x) = sin x, then 
f°’) = (—1)' sin 0 = 0, ee = oe eee 
f"@) = (—1)*" cos 0 = (— 1)", ee Ses 


Hence the nth Taylor approximation 1s the polynomial 

og i 

ii) = 22 

i=0 - 
in which the coefficient of every even power of x is zero. To handle this 
alternation, we define the integer k by the rule 
= if n 1s even 
2 >) 
n+ 1 
Pe 


k = (4) 


, if n is odd. 


It then follows that 
2i—1 


: I oe ee - = 
TQ) = 2) apo CDT ED ee pik 


[If n = O, we have the exception 7)(x) = 0.] For the remainder, we obtain 


l n n 
oe Fe ee 
n+1 

x k tT 

_——_ (—1)"cosc, if m is even, 
1)! 
78 sf (6) 

aa —1)' sinc, __ if nis odd, 


for some real number c (which depends on both x and n) such that |c| < |x]. 
The Taylor formula for sin x about the number 0 is therefore given by 
2i—1 


k 
ae Bee ae ee 
sn x = 2 | 1) (i= 1)! a igs 


where k is defined by equation (4), and the remainder R,, by (6). 
For part (b), the approximating polynomials 7), 72, and 7; can be read 
directly from equation (5) [together with (4)]. We obtain 


T(x) eer. S 
T(x) Ses : 
id= = 


Their graphs together with the graph of sin x are shown in Figure 11. 


sEC. 8] TAYLOR SERIES 535 


Figure 11 


To prove that sin x can be defined by the infinite power series given in 
(c), we must show that, for every real number x, 


sce = i 7X) 
no 
21 —1 


A »—1 
ee oa 


k0 fat 


where k is the integer defined in (4). Since sin x = T,(x) + Rn, an equiv- 
alent proposition is 

hi Fe, = -0. 
To prove the latter, we use the important fact that the absolute values of the 
functions sine and cosine are never greater than 1. Hence, in the expression 
for R,, in (6), we know that |cos c| < 1 and |sinc| < 1. It therefore follows 
from (6) that 


Lies 
oe 
Ral S (n + 1)! 
It is easy to show by the Ratio Test [see Problem 4(b), page 510] that 
n+l 
lim os = 0. Hence lim R, = 0, and we have proved that 
n> (n = 1)! oe 
ee x 
sINX =X — ar + ee 


The form of the remainder in Taylor’s Theorem provides one answer 
to the question posed at the beginning of the section, which, briefly stated, 
was: When can a given function be defined by a power series? The answer 
provided in the following theorem is obtained by a direct generalization of the 
method used to establish the convergence of the Taylor series for sin x. 
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(8.5) Let f be a function which has derivatives of every order at every point 
of an interval I containing the number a. If the derivatives are uniformly 
bounded on I, i.e., if there exists a real number B such that |f'”(t)| < B, for 
every integer n > QO and every t in I, then 


$0) = Def? @E - 47% 


7=0 


for every real number x in I. 


Proof. Since f(x) = Tn(x) + Rn [see Theorem (8.4) and formula (3)], we must 
prove that f(x) = lim 7,,(x), or, equivalently, that lim R, = 0. Generally 


speaking, the number c which appears in the expression for the remainder R,, 
will be different for each integer n and each x in J. But since the number B is a 
bound for the absolute values of a// derivatives of f everywhere in I, we have 
f+ (c)| < B. Hence 


1 3 n 
|Rn| = at! ail Cao) ae 
i Sal os Ix — al" 
However [see Problem 4(b), page 510], 
ee es 
i 


from which it follows that lim R, = 0. This completes the proof. 


nO 


It is an important fact, referred to at the beginning of the section, that the 
convergence of the Taylor series to the values of the function which defines 
it cannot be inferred from the existence of the derivatives alone. In Theo- 
rem (8.5), for example, we added the very strong hypothesis that all the 
derivatives of fare uniformly bounded on J. The following function defined 
by 


0 i x = U, 
f= d 
oe ee a 


has the property that f(x) exists for every integer n > O and every real 
number x. Moreover, it can be shown that f‘”’(0) = 0, forevery n > 0. It 
follows that the Taylor series about 0 for this function is the trivial power 
series 2, 0: x’. This series converges to 0 for every real number x, and 
does not converge to f(x), except for x = 0. 

When a Taylor polynomial or series is computed about the number zero, 
as in Example 2, there is a tradition for associating with it the name of the 
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mathematician Maclaurin instead of that of Taylor. Thus the Maclaurin 
series for a given function is a power series in x, and is simply the special 
case of the Taylor series in which a = 0. 

Suppose that, for a given n, we replace the values of a function f by those 
of the mth Taylor approximation to the function about some number a. 
How good is the resulting approximation? The answer depends on the 
interval (containing a) over which we wish to use the values of the polynomial 
T,. Since f(x) — T,(x) = Ry, the problem becomes one of finding a bound 
for |R,,| over the interval in question. 


EXAMPLE 3. (a) Compute the first three terms of the Taylor series of the 
function f(x) = (x + 1)'/3 about x = 7. That is, compute 
l 
DO =e af MR 7. 


(b) Show that 7,(x) approximates f(x) to within ; = 0.00024 (ap- 


cave 
proximately) for every x in the interval [7, 8]. 
Taking derivatives, we get 
; ] 
“x)= 2 Be ee ee eee ; 
f'@) = a +,)) 3(x + 128 
2 
’?r ——— =—§/3 nee SN. 
I°) = —§0tae 1) O(x + 19/8 
my as eee, Ln ee > gatas 
7 (x) ae a. 3 (x i 1) 2 33(x a8 ] 8/3 
Hence 
f(T) = 8 = 2, 
. 2 Snes 
fll ee cy ae 
" et i. Ses ae a 
LOE eg a 


and the polynomial approximation to f(x) called for in (a) is therefore 
= I tgs = ee 
TA(X)= 2 + qa — 7) — aggt™ i. 
For part (b), we have | f(x) — T2(x)| = |R2| and 


Re = 5 f(x - TY, 


for some number c which is between x and 7. To obtain a bound for |Ra| 
over the prescribed interval [7,8], we observe that in that interval the 
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maximum value of (x — 7) occurs when x = 8 and the maximum value of 
|f’”’| occurs when x = 7. Hence 


Ral < a Lee"D| 8 — 7P. 


since: “(P= a, > we get 
|R3| < | Zz s 5 3 5 


bia ae | ae ees 


34-28 


Hence for every x in the interval [7, 8], the difference in absolute value be- 
tween (x + 1)! and the quadratic polynomial T.(x) is less than 0.00025. 


PROBCEMS 


1. For each of the values of n indicated, compute the Taylor polynomial T,, 
which approximates the function f near the number a. 


1 

(a) ee eg 
1 

(b) i 
1 

(c) IO= a Se 


(d) f(x) =Vx+1,a = 3,n = 1, 2, and 3. 
(ec) f(x) = sinx,a = a. — 0, 1, and 2. 


2. Compute the formula, for an arbitrary nonnegative integer n, for the approxi- 
mating Taylor polynomial to the function f about the number a. 


(a) - F(x) = cos x, a =@ (br ff) = 36 a= 
3. Forn = 0,1, and 2, compute the Taylor polynomial 7,, which approximates the 


function f near 0. Draw the graphs of the three polynomials together with the 
graph of /. 


(a) f(x) =e (b) f(x) = cosx. 
4. Let p be a polynomial in x of degree <m; i.e., the function p is defined by an 
equation 


P(x) = ao + ax +++ + anx", 


and let 7, be the Taylor polynomial which approximates p near an arbitrary 
real number a. Prove, as a simple consequence of Taylor’s formula with the 
remainder, that p(x) = T,(x), for every real number x, provided n > m. 

5. For each of the values of 1 indicated, compute the approximation 7, to the 
polynomial p near the number a. 


GQ) PO) = 4 3k = 414,80 = SS ee 
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10. 


oe 
Ea: 


<— x 
. Show that sin x differs in absolute value from the approximation x — = 


Ss) pi) = 2 — 5x* + 3, a = 0, 2. =—-1,2,.and3: 
(c) p(x) 


i) te = 2 = 1 a = 1, = 2, 3, and 4. 


x* 4 3x? + x + 2,a = 0, = 3, 4, and 17. 


. Prove that, for every real number x, 


ea 27 
ix 
cos. x = > (—1) Qi)! 
x x x? 
Sr peg age 


. For each of the following functions, compute the Taylor series about a. 


x 


i=". ifa =0 (b) > fg 2 (c) arctan x, if a = 0. 


. (a) Compute the cubic Taylor polynomial p(x) which approximates the func- 


tion for values of x near the number 1. 


(b) Show that, for every x in the interval [0, 2], the approximation p(x) differs 


in absolute value from ae by less than 0.04. 
4-2 , 


= 0.025 (approximately) for every x in the interval 


by no more than 


-5-4] 


The Taylor approximation 7, to a function f about the number a is frequently 
called the best polynomial approximation of degree <n to the function f near a 
because it can be shown that 7,, is the only polynomial of degree <n with the 
property that, as x approaches a, the difference f(x) — 7;,(x) approaches zero 
faster than (x — a)”. 

Prove the following part of the above assertion: If f has continuous 
(n + 1)st derivative in an open interval containing a, then lim —— oa = 

Ia "eee 

What cubic polynomial best approximates x4 — 2x? + 3x — 3 near x = 2? 
Another statement of Taylor’s Theorem which gives a different form for the 
remainder is the following: Let f be a function with continuous (n + 1)st 
derivative at every point of the interval [a,b]. Then 


10) = fla) + flay = a) +++ + FO = 0)" 


i. SS Y ptD 4) dt. 
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(a) Using integration by parts, show that 


fs Lae, Y g0+0 (7) dt 


1 
5 fab - ay + [ CaO $C dt. 


(b) Using induction on n and the result of part (a), prove the above form of 
Taylor’s Theorem in which the remainder appears as an integral. 


13. Let f have a continuous second derivative at every point of an interval contain- 
ing the number a in its interior, and let f’(a) = 0. Show that f has a local 
maximum value at aif f’’(a) < 0, and a local minimum value at aif f’’(a) > 0. 
[Hint: Use the Taylor Formula f(x) = 7,(x) + R; and the fact that, if a 
continuous function is positive (or negative) at a, then it is positive (or negative) 
near a.]| 


CHAPTER@IO 


Geometry in the Plane 


Suppose that we are concerned with the motion of a particle as it moves 
in a plane. At any time ¢ during the motion, the position of the particle is 
given by its two coordinates, which depend on time, and may therefore be 
denoted by x(t) and y(t), respectively. The set of points traced out by the 
particle as it moves during a given interval of time is a curve. The function 
which describes the position of the particle is called a parametrization, and a 
curve described by such a function is said to be parametrized. In the first 
sections of this chapter we shall develop the mathematical theory of param- 
etrized curves, abstracting from the picture of a physical particle in motion. 
Later we shall return to this application and define the notions of velocity and 
acceleration of such particles. 

Parametrized curves represent an important generalization of the curves 
encountered thus far as the graphs of functions. As we shall see, a param- 
etrized curve is not necessarily the graph of an equation y = f(x). 


1. Parametrically Defined Curves. When we speak of the plane in this book, 
we assume, unless otherwise stated, that a pair of coordinate axes has been 
chosen. As a result, we identify the set of points in the plane with the set R’ 
of all ordered pairs of real numbers. A convenient notation for a function P 
whose domain is an interval J of real numbers and whose range is a subset of 
the plane is P: J R®. Every function P: J— R? defines two coordinate 
functions, the functions which assign to every ¢t in J the two coordinates of the 
point P(Z). If we denote the first coordinate function by f, and the second one 
by g, then they are defined by the equation 


P(t) = (f(A), 2(d), for every ¢ int, 22 


Conversely, of course, every ordered pair of real-valued functions f and g 
with an interval J as common domain defines a function P: J — R? by equa- 
tion (1). 

Since the first and second coordinates of an element of R? are usually 
the x- and y-coordinates, respectively, we may alternatively define a function 


541 
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P: I— R? by a pair of equations 
x = f (0), 
y = gt?) 


where f and g are real-valued functions with domain J. Then, for every ¢ in J, 
we have P(t) = (x, y) = (f(), g(f)). It is also common practice to denote 
the coordinate functions themselves by x and y. When this is done, we do not 
hesitate to write the equations x = x(t) and y = y(t), and the function 
P: I— R? is defined by 


P(t) = (xd), y()), for every ¢ in J. 


A function P: J — R? is said to be continuous at f) if both coordinate 
functions are continuous at fo. If the coordinate functions are denoted by x 
and y, then we define 

lim P(t) = (lim x(1), lim y(f)). 
t—tg t—to t—to 
As a result, the definition of continuity for P is entirely analogous to that for a 


real-valued function: P is continuous at fo if fp is in the domain of P and if 
lim P(t) = P(t). As before, the function P is simply said to be continuous 


if it is continuous at every number in its domain. 


A curve in the plane is by definition a subset of R? which is the range of 
some continuous function P: J] — R*. Every curve is the range of many such 
functions, and, as a result, it is necessary to choose our terminology carefully. 
We shall call a continuous function P: J — R? a parametrization of the curve 
C which is the range of P, and we shall say that C is parametrically defined 
by P: [— R?*. The points of the curve C obviously consist of the set of all 
points P(t), for every tin J. By a parametrized curve we shall mean the range 
of a specified continuous function P: [— R*. Speaking more casually, we 
shall refer to the curve defined parametrically by 


P(t) = (x(), yd), 
or, equivalently, to the curve defined parametrically by the equations 
x = 2h, 
i = yo), 


for every ¢ in some interval J which is the common domain of the continuous 
functions x and y. If ¢ is regarded as an independent variable, it is called the 
parameter of the parametrized curve. 
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EXAMPLE 1. Draw the curve defined parametrically by 
Pi = -@",.1), —-0o <t< o. 
This is, of course, also the curve defined by the equations 
ie ie a 
y=, = he al <a. 


It is plotted in Figure 1. Since the set of all points (x, y) which satisfy the 
above two equations is equal to the set of all points (x, y) such that x = y’, 
we recognize the curve as a parabola. 


TABLE 1 


Figure | 


It is worth noting that every curve which we have previously encountered 
as the graph of a continuous function f can be defined parametrically. The 
graph is the set of all points (x, y) such that x is in the domain of f and such 
that y = f(x). This set is obviously equal to the set of all points (x, y) such 
that 
t, 


FH, and ¢ is in the domain of f. 


Xx 


(2) 


- 


Hence the graph of fis defined parametrically by equations (2). 

A function P: J] — R? is differentiable at ft) if the derivatives of both 
coordinate functions exist at fo. Moreover, following the usual style, we say 
that P is a differentiable function if it is differentiable at every number in its 
domain. This terminology is also applied to parametrized curves. That is, a 
curve defined parametrically by P: J > R? is said to be differentiable at fo, 
or simply differentiable, according as P is differentiable at fo, or is a differ- 
entiable function. 


544 GEOMETRY IN THE PLANE [cuap. 10 


EXAMPLE 2. Draw and identify the curve C defined parametrically by 
P(t) = (x(), yD) = (4 cos ¢, 3 sin 2), 


for every real number f¢. If (x, y) is an arbitrary point on the curve, then 


: x = 4COEG 
pS Fom 7, 
for some value of t. Hence, ‘ = cos ¢ and 5 = sin ¢, and, consequently, 
x y : = 
Go eo! Tee i 


Thus for every point (x, y) on the curve, we have shown that 


2 
xX 


2 
ae Ri ee 
16 + 9 hi (Zé) 
The latter is an equation of the ellipse shown in Figure 2, and it follows that 
the curve C is a subset of the ellipse. Conversely, let (x, y) be an arbitrary 
point on the ellipse. Then |x| < 4, and so there exists a number f¢ such that 


Figure 2 


x = 4cost. Since cos ¢t = cos(—?f) and sint = —sin(—1f), we may choose 
t so that sin ¢ and y have the same sign. Then, solving equation (3) for y 
and setting x = 4cos ¢, we obtain 


2 2 
vs ee eee ee 
y -9(1-%)- (; 16 ) = 9a COs Tf) 
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Since y and sin ¢ have the same sign, it follows that y = 3sint. We have 
therefore proved that, if (x, y) is an arbitrary point on the ellipse, then there 
exists a real number ¢ such that 


(x, y) = (4cos t, 3 sin t) = P(t). 


That is, every point on the ellipse also lies on C. We have already shown that 
the converse is true, and we therefore conclude that the parametrized curve C 
is equal to the ellipse. 


Consider a curve C defined parametrically by a differentiable function 
P: I > R?2, and let fo be an interior point of the interval J. A typical example is 
shown in Figure 3. Generally it will not be the case that the whole curve is a 
function of x, since there may be distinct points on C with the same x-co- 
ordinate. However, it can happen that a subset of C containing the point 
P(t) is a differentiable function. Such a subset is shown in Figure 3, drawn 


P(t) = (x(t), y(t) 


Figure 3 


with a heavy line. Thus if P(t) = (x(), yO) for every ¢ in J, then there may 
exist a differentiable function f such that 


y) = f(x), (Ai) 


for every ¢ in some subinterval of J containing f in its interior. If such a func- 
tion does exist, we shall say that y is a differentiable function of x on the 
parametrized curve P(t) = (x(t), y(t)) in a neighborhood of the point P(to). 
Applying the Chain Rule to equation (4), we obtain 


y(t) = f(x) xO. 
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Hence 


(5) 


for every ¢ in the subinterval, for which x’(t) ¥ 0. If we write y = f(x) and 
use the differential notation for the derivative, formula (5) becomes 


(6 ) 


dy 
It should be apparent that f’(x(2)), or, equivalently, = at f, is equal to the 
slope of the curve C at the point P(r). 


EXAMPLE 3. Find the slope, when ¢ = a of the parametrized ellipse in 
Example 2. The parametrization is defined by the equations 


x = 4cost, 
y = 3 see 


We shall assume the analytic result that y is defined as a differentiable function 
of x in a neighborhood of the point 


VT . T 
(4c05 3-3 sin 2) 
1 . 0 \/3 3/3 
(4053-3 .sin 2) = (4-3 503° =) *) = @ = ). 


one can see by simply looking at Figure 2 that this should certainly be true 
since the curve passes smoothly through the point and, in the immediate 
vicinity of the point, does not double back on itself. We have 


Since 


dx d 
ws Hoos t = —4sint, 
dy _ Die 3 cos f, 


dt dt 
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and so 
dx a : V3 im 
eo ee 
dy = e . 2. 
=e =" = 3 cos % 3 


Hence, by formula (6), the slope is equal to 


dy 
dy ee dt t=7/3 3 ie “Bet 
GX|tanja ax Se 4/3 
At |1—1/3 


The problem of giving analytic conditions which imply that y is a differ- 
entiable function of x on a parametrized curve in the neighborhood of a 
point is akin to the problem of determining when an equation F(x, y) = c 
implicitly defines y as a differentiable function of x in a neighborhood of a 
point. As mentioned on page 81, the latter is solved by the Implicit Function 
Theorem, and the techniques needed here are similar. 

As a final example, let us consider the curve traced by a point fixed on the 
circumference of a wheel as the wheel rolls along a straight line. We take the 
x-axis for the straight line. The radius of the wheel we denote by a, and the 
point on the circumference by (x, y). If we assume that the point passes 
through the origin as the wheel rolls by to the right, then the curve is defined 
parametrically by the equations 


x = a(é — sin 6), 
y = a(l — cos 8), = =< 2 


where the parameter @ is the radian measure of the angle with vertex the 
center of the wheel, initial side the half-line pointing vertically downward, 
and terminal side the half-line through (x, y) (see Figure 4). (An alternative 


Figure 4 
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geometric interpretation of the parameter is that a@ is the coordinate of the 
point of tangency of the wheel on the x-axis.) The curve is called a cycloid. 
Note that the parametric equations are quite simple, whereas it would be 
difficult to express y as a function of x. 


PROBLEMS 


1. Draw and identify each of the curves defined by the following parametrizations. 


(a) PO) = GO -2 St= 4, 

(Pay = —- 4, V), —oe <t< ow, 

(iC) Pi) i = ee 2 Se 
(d)- PU) =r" a. —ox <t< ow, 
()ioPiice dition igtt).ci sere. Soave 


ci) Pay Sa eet, Sie, 0 Te. 
(co)... (p= Gite. OO 5 Sy EE OSE 
(h) OC) = QC sin r, 3 cosy), —~wx <r< o, 


2. Draw and identify each of the following parametrized curves. 


x=rt-1, 

(a) 
y = 2t + 3, SI OF <S OO., 
Les, 

co | 
y=t —3, —~a7 <t< ow, 
Kos 2s 

(c) - 
yo = COs f, + = ee we 
= I SeGi0, 

(d) y =‘Ziane, Mesa 5 Sena 


2 Z 


3. Each of the following parametrized curves is a function fof x. [To put it another 
way, each is the graph of an equation y = f(x).] Find f(x). 


(a) Po = G¢—1,7+4+)), — Hee P< oO, 
(b) | 
=e". —~n <f< a, 


eee a 
(c) : 
y =-3 Smit, ee ee 


id) PQ) =e. 2, —O <-¢ <hory 


SEC. 1] PARAMETRICALLY DEFINED CURVES 549 


4. For each of the following parametrizations, find an equation F(x, y) = c whose 
graph is the parametrized curve 


(a) P(t) = (¢*, 2), —o <t< ow, 
Co) 

y =e’, —o <ft< w, 

a a i a 
(c) ss = 

y e es, i ee 


[For (b) and (c), you will need in addition to the equation F(x, y) = c, 
the inequality x > 0.] 


5. For the ellipse in Example 2 parametrized by the equations x = 4 cost and 
y = 3 sinfé, interpret t geometrically. (Hint: See Figure 5.) 


y-axis 


Figure 5 


6. Sketch the curve defined by the parametrization 


x = ad — bsin8, 
y =a-— boos, S06 <a oe, 
This curve is traced by a point, fixed on a radius of a wheel of radius a at a dis- 


tance b from the center, as the wheel rolls along a straight line. There are two 
cases. 


(a) The curtate cycloid, where a > b. (Think of a point on the spoke of a 
wheel.) 


(b) The prolate cycloid, where a < b. (Think of a point on the flange of a 
railway wheel.) 
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7. For each of the following parametrized curves, assume that y is defined as a 
differentiable function of x in a neighborhood of the points indicated, and find 


d ; 
ed at the point. 
dx 


is Ba = Ors 1 ae =e 


yee 3 COS S, 
(0) y= 3 sin s, when s = 7 
Ki ee Bb 
C 
© Maker oars 
= eC 
(d) 
y = f, when 7 = In 5. 


8. Find the slope of each of the following parametrized curves at the point indicated. 


(a) Pi) = ¢ —1,2 — 3 + 3t — 1), at PC). 


x = 3 cos 7, 

(b) y = 3sint, when t = 7. 
Ce eS ds 

(c) ; 


y= +2+, at d, 3). 
(dd) 00 @=t he + Dace 
9. (a) Assuming the necessary differentiability conditions on the parametrization 
P(t) = (x(d), yd), derive a formula for ye 


(b) For the curve parametrized by the equations x = +4141 and 
d“y 


y = f+ 31, find >; when t = 1. 


2 
X 


2. Arc Length of a Parametrized Curve. The straight-line distance in the 
plane between two points P = (x;, y;) and Q = (x2, ys) is defined in Section 2 
of Chapter 1 by the formula 


distance(P, O) = V(x. — x1)? + (2 — yi). am 
In this section we shall consider the harder problem of defining distance, or 
arc length, along a parametrized curve. 
Let C be a curve parametrically defined by a continuous function 
P: I[— R?, and let a and 5b be two numbers in the interval J such thata < b. 
As we have seen in Section 1, C is the set of all points 


P(t) = (x(2), vd), for every ¢ in J. 
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Consider a partition 0 = {t,..., t,} of the closed interval [a, b] such that 
Oe oy 1, So" 5, ee, 
on Set, or Every i = 0,...,n. We shall take the number Z, 
defined by ‘ 
i, => apne Pe, P;) ee 
i=l 


as an approximation to the arc length along C from P(a) to P(6). In Figure 6, 
the number L, is the sum of the lengths of the straight-line segments joining 
the points along the curve. Using (1), we may also write 


Le = 0 Vie) — xP + DG) — YP. 


P, 


P) = P(a) 


Figure 6 


The principle which motivates the definition of arc length is the fact that 
if one partition o of [a, b] is a subset of another partition 7, then Z, is in 
general a better approximation than L,. The basic reason for this is simply 
that the finer partition 7 determines more points on the curve. As an example, 
consider Figure 7, in which L, is the sum of the lengths of the solid straight- 

P(a) 


P(b) 


<a e 


Figure 7 
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line segments, and L, is the sum of the lengths of the dashed-line segments. 
It is clear from the picture that L, is closer than L, to our intuitive idea of the 
arc length of the curve. Moreover, 


(2.1) Jf one partition o of [a, b] is a subset of another 7, then L, < L,. 


Proof. This fact is geometrically apparent from Figure 7. The argument can be 
reduced to consideration of a single triangle by the realization that, since 7 
can be obtained from o by adjoining one new point at a time, it will be suffi- 
cient to prove the result under the assumption that 7 differs from o by the 
inclusion of only one additional point, which we denote by f¢,. Let P,, = 
P(t,). For some integer i, we have t;-1 < t, < ¢;. Then the expressions for 
L, and L, are obviously the same except that the term distance(P;—1, P;), in 
L, is replaced by the sum distance(P;_1, P,.) + distance(P,,, P;) in L;. Hence 


L, — L, = distance(P;_1, P,,.) + distance(P,,, P;) 
— distance(P;_1, P;). 
It is clear from the triangle in Figure 8 that the right side of the preceding 


equation cannot be negative. We conclude that L, — L, > 0, or, equiva- 
lently, that L, > L,, and the proof is complete. 


Figure 8 


Thus partitions with more points result in approximations at least as 
large. This brings us to the definition of arc length: Let C be a curve param- 
etrized by a continuous function P: J > R?, and let aand b be two numbers 
in J with a < b. We consider the set of all real numbers L, formed from all 
partitions o of the interval [a, b]. This set, denoted by {L,}, either has an 
upper bound or it does not. The arc length of the parametrized curve C from 
P(a) to P(b) will be denoted by L,” and is defined by 


__ {the least upper bound of the set {Z,}, if an upper bound exists, 


b 
La 0, if {L,} has no upper bound. 


A curve parametrized by a continuous function P:[a, 6] — R? is said to 
be rectifiable if its arc length L,” is finite. 

The main difficulty with the above definition, like that of the definite 
integral, is that it is by no means immediately apparent how to use it to 
compute the arc lengths of even very simple rectifiable curves. We shall now 
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show that if the parametrization satisfies a simple differentiability condition, 
then the arc length 1s given by a definite integral. 


(2.2) THEOREM. Consider a parametrization defined by P(t) = (x(t), y(1)), 
for every t in an interval [a, b]. If the derivatives x' and y’ are continuous func- 
tions, then the curve C parametrized by P is rectifiable and its arc length is given 


by : 
Lb = | veut yO at 


(Note that we have used x’(t)? for [x’(4)}? and y’(z)? for [y’()]?.. This is a 
common abbreviation, which we shall not hesitate to use whenever it causes 
no ambiguity.) 


Proof. If a = b, then L,? is certainly equal to zero, as is the integral; so we assume 
that a < b. Leto = {fo,..., tn} be a partition of [a, 5] with 


Sy ee ee, Se, 
and set P; = P(t;). In each open subinterval (t;_1, ¢;) there exist, by the Mean 
Value Theorem, numbers f,; and ¢;2 such that 
x(t) — x(ti-1) = x’ — fi-1), 


y(t) — yi) = y' (titi — ti-1). 


Hence 


distance(P;_1, P:)) = Vixtt) — x(i_D2? + vt) — y(ti—vP? 
= a/ x! (ti)? i y (ti)? fy = iE), 
and so 


L, = e Vx" (ti1)? + y’ (tie)? (ts — fi_1). (3) 
nee 


The conclusion of the theorem should now seem a natural one. Since x’(f) 
and y’(f) are continuous functions, so is Vx’(t)? + y’(7)?.. We know that 
continuous functions are integrable. It is therefore very reasonable to suppose 
that, for successively finer and finer partitions, the right side of equation (3) 
approaches the integral if /x'(t)? + y(t)? dt. Tf this is so, it follows in a 
straightforward manner from (2.1) that the set {L,}, for all partitions o, 
must have the integral as a least upper bound, and the proof is then finished. 
To complete the argument, it therefore remains to prove that 


b 
lim L, = | /x'(t)? + y(t)? dt. 


ll oll—o 


We recall that the fineness of a partition o is measured by its mesh ||o||, 
defined on page 413 to be the length of a subinterval of maximum length. 


554 
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Unfortunately, the preceding equation does not follow directly from the 
theory of Riemann sums because L,, as it is given by equation (3), is not a 
Riemann sum for the function »/x’(t)? + y’(t)?. It fails to be one because, 
in each subinterval of the partition, we have chosen two numbers f;; and f;2 
instead of one. To overcome this difficulty, we shall use a theorem about 
continuous functions of two variables, whose proof, although not deep, 
requires the concept of uniform continuity and we shall omit. From equa- 
tion (3), we write the identity 


L, = 2. Vx (ti)? + y' (ta)? te — ti-1) 
iat 


+ | veqar Gat — VEC GM] — 0 
+=] 


The first expression on the right is a Riemann sum for \/x’(1)2 + y’(t)? 
relative to o, and we shall abbreviate it by R,. Next, let F be the function of 
two variables defined by 


F(t, s) = Vx’)? + y'(s)2 — Vx’? + 2, 


for every ¢ and s in the interval [a, b]. As a result, we can write the above 
expression for L, as 


L, = he tT pe Pug. tan ti-1)- 


i=] 


Hence 


kL, — Re = x Ga A wes ~ fea 


i=1 
which implies that 


Le — Rel S >) |Ftiy tee)|(ti — te—2). (4) 
taal 


The function F is continuous, and, as is obvious from its definition, F(t, t) = 0 
for every ¢ in [a, b]. As a result, it can be proved that |F(t, s)| is arbitrarily 
small provided the difference |¢ — s| is sufficiently small. This is the theorem 
which we shall assume without proof. It follows that, for any positive number 
e, there exists a positive number 6 such that, if o is any partition with mesh 
less than 6, then 

\F(ti, tis)| < €, for every i. 


Hence, by the inequality (4), we obtain 
Le — Rol < Dd) e(ti — tit) = €(b — a), 


fe | 


/ 
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for every partition o for which ||c|| < 6. Since e can be chosen so that 
e(b — a) is arbitrarily small, we may conclude that lim (L, — R,) = 0. 
The proof is now virtually finished. We write l|ol|o 


iL, = 8, +4, = £.. 


Since R, is a Riemann sum, we know that 


b 
ba R, = | V/x'(t)2 + y(t)? dt. 
ail 0 a 


Hence 
lim L, = lim R, + lim (ZL, — R,) 


l|oll—o loll—o ll oll—o 


b 
| Vx! (t)? + y’(1)? dt + 0, 


and Theorem (2.2) is proved. 


EXAMPLE |. Compute the arc length of the curve C defined parametrically by 
P(t) = (x(0), y(d), where 


~ = a(t — sin f), 
y(t) = a(l — cos 2), a> 0), 


between P(O) = (0,0) and P(27) = (27a,0). The curve C is the cycloid 
discussed at the end of Section 1 and illustrated in Figure 4. We have 


x’) = adl — cos 2), 
yy, =.4 st. 
These are obviously continuous functions, and the arc length is therefore 
given by the integral formula. We obtain 
x(t)? + y(t)? = a[(1 — cos t)? + sin2z] 
= a*{1 — 2cost + cos*t + sin?Z] 
= ait — Zcost + 1] = 2a{1 — cos f]. 


However, we have the trigonometric identities 


t ot tek 

cos i = cos 2*= = cos = — sin => 
Z z 2 

3 = S 

bese cos’ — sii * 
hea 


from which it follows that 


: t 
1 — cost = Zsin 5 
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Hence 
x!(t)” aa y(t)’ = 4q’ sin” 7 


ot. ; 
Since sin 5 is nonnegative for every ¢ in the interval [0, 27], we conclude that 


Jee a sin 5 a oe 2. 


Thus the arc length L = L)’” is given by 


h 
I 


27 21 
| V/x'(t)? + y(t)? dt = | 2a sin : dt 
0 0 


27 


—4a cos 5 = —4acos7 + 4acos0O = 8a. 
0 


Suppose that a curve is given as the graph of a continuously differentiable 
function. In more detail: Let the derivative f’ of a function f be continuous at 
every x in some interval [a, 6]. The graph of the equation y = f(x) is a curve 
which can be parametrically defined by 


x(t) = t, 
yop); aS =. 
Then x’(t) = land y’(t) = f’(t). Since x’ is a constant function, it is certainly 


continuous. Since f’ is by assumption continuous on [a, b] and since y’ = f : 
the function y’ is also continuous. Hence the arc length L,’ is given by 


b b 
L,’ = | Vx'(t? + y(t)? dt = | V1 + f(t)? dt. 


The variable of integration which appears in a definite integral is a dummy 
variable (see page 171), and we may therefore replace ¢ by x in the right 
integral. Thus, we have proved, as a corollary of Theorem (2.2), 


(2.3) If the derivative of a function f is continuous at every x in an interval 
[a, b], then the graph of y = f(x) is a rectifiable curve and its arc length L,° 
is given by 


b 
= | V1 + f'(x)? dx. 
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EXAMPLE 2. Find the arc length L of the graph of the equation y = x? from 
the point (0, 0) to the point (2, 4). The curve is the familiar parabola shown 
in Figure 9. Using the result of the preceding theorem, we have 


ref qe dx 


2 
| oe | /1 + 4x2 dx. 
0 


e y 
S === 2X, 
mnce x 


(0, 0) 


Figure 9 


This integral can be evaluated by means of the trigonometric substitution 
See or, equivalently, 2x = tan’. If x = 0, then @ = 0, and, 
similarly, if x = 2, then 6 = arctan4. For convenience we shall set 
arctan 4 = 6). The substitution yields \/1 + 4x? = 1/1 + tan26 = sec @, 
and dx = 3sec?0d6. Hence using the Change of Variable Theorem for 
Definite Integrals, we obtain 


| V1 + 4x2 dx = | sec’ dé. 
0 0 


The reduction formula on page 369 gives 


| sec (dg = ee +4 1 | sec oa 


and also on page 369 we have 


[sc 6 dé = In\sec 6 + tan 6| + c. 
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It follows that 


4 
q[sec 4) tan 0) + In|sec 6) + tan 6o)]. 


69 
0 


99 
i | sec’d d§ = ea + 4In|sec 6 + tan a 
0 


Since 0) = arctan 4, we have tan 6) = 4 and sec @ = \/1 + tan?0) = /17. 
Hence the arc length L is equal to 


L = HV/17- 44 eT + 4) 
= V17 + dIn(V/17 + 4) 
= 4.64 (approximately). 


PROBLEMS 


1. Find the arc lengths of the following parametrized curves. 


(a) ‘ f= de 
ee hae from (2, 1) to (5, 8). 
(b) wi = 7, 
y(t) = 2(2¢ + 1)*”, from (x(0), y(0)) = (0, 2) to (x(4), y(4)) = (16, 18). 


ic) Fin = TF). from P(0) to P(2). 
(d) x0) = tose, re as» 
y(@) = asin°6, from (x(0),¥(0)) = (a, 0) to (x(r/2), y(r/2)) = (0,4). 
2. A particle in motion in the plane has position equal to 
P(t) = (0 + 4,244t + 333”) 
at time ¢. How far does the particle travel along its path from time t = 0 to 
time ¢ = 1? 
3. Find the arc lengths of the graphs of each of the following functions f between 
the points (a, f(a)) and (4, f(d)). 
@y fix) = £7, a = lane = 4. 
(b) f(x) = 2? + 1)??, a = Oand b = 2. 
(c) fx) = 37,6 =O aid 
(d) f(x) = 3(@ + e*), a = —1 and b= Tf. 
4. Show that the circumference of an ellipse with the line segment joining (—a, 0) 
and (a, 0) as major axis and the line segment joining (0, —4) and (0, 4) as minor 
axis IS given by an integral 


ra V/1 + k sin26 dé. 
0 


Evaluate the constants K and & in terms of aand 5. (Do not attempt to evaluate 
the integral.) 
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5. (a) Let g be a function which is continuously differentiable on the closed in- 
terval [c,d]. Prove, as a corollary of Theorem (2.2), that the arc length 
L.¢ of the graph of the equation x = g(y) between the points (g(c), c) and 
(g(d), d@) is given by the formula 


d 
Le = | ViFeore. 


(b) Find the arc length of the graph of the equation x = 4(y? + 2)*? between 
2/2 _ 
the point ce : 0) and the point (2/6, 2). 
(c) Express as a definite integral the arc length of that part of the graph of the 
equation x = 2y — y’ for which x > 0. 


6. The coordinates of a particle in motion in the plane are given by 


x 2 : 
f = 
at time fr. What is the distance which the particle moves along its path of 
motion between the times ¢ = 0 and ¢ = 2? 
7. The same curve can be defined by more than one parametrization: 
(a) Draw the curve defined parametrically by 


xy St, 
tp eee may ee 


(b) Draw the curve defined parametrically by 


xi) = Sinai, 
y(f)t= sin st, ee ae ee ie 


(c) Compute the arc lengths from ¢t = 0 to ¢ = 1 for the parametrizations in 
(a) and (b). 


(d) Give a geometric interpretation which explains the difference between the 
arc lengths obtained for the two parametrizations. 


8. Let P:[a, b] — R? and Q:[c, d] — R° be two parametrizations of the same curve 
C such that all four coordinate functions are continuously differentiable. (A 
function is continuously differentiable if its derivative exists and is continuous at 
every number in its domain.) Then P and Q are called equivalent parametriza- 
tions of C if there exists a continuously differentiable function f with domain 
[a, b] and range [c, d] which has a continuously differentiable inverse function, 
and in addition satisfies 


(i) f(a) = cand f(b) = d, 
(ii) P(t) = O(f(1)), for every ¢ in [a, 5). 
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(a) Using the Chain Rule and the Change of Variable Theorem for Definite 
Integrals (for the latter, see page 215), prove that equivalent parametriza- 
tions assign the same arc length to C. 


(b) Show that 


Pt): = (6es7 sit #), <= 5 
O(s) = ce = Sos 1 
ea ge ee ein ee 


are equivalent parametrizations of the same curve C, and identify the curve. 
(c) Show that 
PO) = ‘ees fF Sin 7, pf =< Dr, 


and 
Os) = (cos 57, 5m 57); Oo < f 2r, 
are nonequivalent parametrizations of the circle. 


9. Prove directly from the least upper bound definition that arc length is additive, 
i.e., that £,2 + Ly° = L,°. 


3. Vectors in the Plane. A vector in the plane is an ordered pair (P, Q) of 
points in the plane. The point P is called the initial point of the vector and 
QO the terminal point. Geometrically, the vector (P, Q) will be represented as 
a directed line segment, or arrow, from P to Q, as illustrated in Figure 10. 
We shall use boldface lower-case letters to denote vectors. For example, if v 
is the vector with initial point P and terminal point Q, then v = (P, Q). 


Q 


Figure 10 


Having identified the plane with the set R? of all ordered pairs of real 
numbers, we see that a vector is determined by four real numbers: two 
coordinates of its initial point, and two of its terminal point. Let v be a 
vector with initial point P = (a, 6) and terminal point Q = (c,d). Then the 
two numbers v, and vz given by the equations 


Sa a cae f 


Vv, = d — Bb, 


(1) 
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are defined to be first and second coordinates, respectively, of the vector v in 
R?. Thus we have defined coordinates of a vector in R? as well as coordinates 
of a point in R?. The definitions are not the same, although the concepts are 
certainly related. 

If a vector v has initial point P = (a, b) and coordinates v; and ve, then 
equations (1) tell us that the terminal point Q = (c, d) is given by 


O = (a+ 1,56 + vo). 


It follows that a vector is completely determined by its initial point and its 
coordinates. Hence, another notation for a vector, which we shall use, is 


[Although it would be consistent with this notation, we shall not write 
(Uj, U2)a.5) for the vector with initial point (a, 6) and coordinates v; and v».] 
The length of a vector v = (P, Q) in R? is denoted by |v| and defined by 

lv| = distance(P, Q). 


If P = (a, b) and Q = (c, d), then the formula for the distance between two 
points implies that 


lv] = V(c — ay? + (d — 6). 


From equations (1) it follows that the coordinates of the vector v are the two 
numbers v; = c — aand v, = d — b. Hence 


(3.1) The length of any vector ¥ = (U,, U2)p is given by 


Thus the length of a vector depends only on its coordinates. 


EXAMPLE |. Find the terminal point of each of the following vectors. Draw 
each one as an arrow in the xp-plane, and compute its length. 


ty = 1 2, where F = Cl, }), 

(b) u = (4, —1)p, where P = (1, 1), 
fc) Ww = 2” 5), MuerenO. = (0, = 1), 
(d) x = (3, —4)o, where O = (0, 0). 
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We have seen that, if P = (a, 6), then the terminal point of the vector 
(U1, V2)p 1s the ordered pair (a + U1, b + v2). It follows that 


terminal point of v = (1 + 1, 1 + 2) = (, 3), 
terminal point of u= (1 + 4,1 + (—1)) = 6, 0), 
terminal point of w = (0 + (—2), -1+ 5) = (—2, 4), 
terminal point of x = (0 + 3,0 + (—4)) = @G, —4). 


y-axis 


\ X-axis 
Q 


2 Figure 11 


The vectors are drawn in Figure 11. Their respective lengths, computed from 
the formula in (3.1), are 


lv} = V2 + 22 = V5, 

jul = V4? + (—1)? = V17, 

lw] = V(—2)2 + 52 = V29, 
Ix] = 32 + (—4p = V25 = 5. 


We shall denote the set of all vectors in R? by U. For every point Pin R?, 
the subset of U consisting of all vectors with initial point P will be denoted by 
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Up. We shall now define, in each set Up, an operation of addition of vectors 
and an operation of multiplication of vectors by real numbers. 

Addition in Up is defined as follows: If u = (ui, u2)p and V = (Ui, V2)p 
are any two vectors in Up, then their sum u + v is the vector defined by 


u tv = (uy + 41, U2 + V2)p. C3) 


Note that the sum of two vectors in Up is again a vector in Up. Furthermore, 
if u is in Up and vis in Ug, then their sum is not defined unless P = Q. That 
is, the sum of two vectors is defined if and only if they have the same initial point. 
For every vector v = (vi, U2)p, we denote the vector (—v1, —v2)p by —v. 
In this way, subtraction of vectors in Up is defined by the equation 


Bu y= + (—Y); 


which implies the following companion formula to (3): - 


The unique vector in Up with both coordinates equal to Zero is called the zero 
vector and will be denoted by 0. Thus 


= (0, O)p me (RaP), 


Obviously, the equations 
v+0=Yy, 


yv—v=0 


are true for every vector v in Up. Geometrically the zero vector in Up 1S 
represented simply by the point P. Of course, there are as many different zero 
vectors as there are points in the plane, and one cannot tell from the notation 
0 to which set Up a given zero vector belongs. It is obvious that every zero 
vector has length zero. Conversely, the length of a nonzero vector must be 
positive, since at least one of its coordinates is not zero. Hence 


(3.2) A vector v is a zero vector if and only if |v| = 9. 


Geometrically, the sum u + v of two nonzero vectors u and v in Up 
is the vector in ‘Up which is a diagonal of the parallelogram which has u and v 
as sides. This is the famous Parallelogram Law and is illustrated in 
Figure 12(a). It can be verified in a straightforward way by computing the 
slopes of the various line segments, and we omit the details. Similarly, the 
vector —y is represented geometrically as a directed line segment lying in the 
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same straight line as v, but in the opposite direction, as shown in Figure 12(b). 
Moreover, the vectors v and —v have the same length, since 


IW] = Vor + vo? = V(r + (—2)? = |-vI. 


(a+u,+v,,0+u,+v,) 


atv ,oF1s 


(a HD Uy) 


P =(a,b) 


(a) The Parallelogram Law (5) The negative of a vector 


Figure 12 


The second algebraic operation in Up is defined as follows: For every 
real number a and every vector v = (U;, V2)p in Up, we define a vector ay, 
called the product of a and v, by the equation 


In traditional vector terminology, the real number a is called a scalar. Note 
that we have not defined a product of two vectors. If we compute the length 
of the vector av, we find that 


jay] = V (ani)? + (ae = Valor + 02) 
= jalV/v2 + v2 = lal lvl, 


a result which we summarize in the statement 


(3.3) |av| = |al|v|, for every real number a and every vector v. 


If v is an arbitrary nonzero vector in Up and if a # O, then the slope of 
the line segment joining P to the terminal point of v is the same as that joining 
P to the terminal point of avy. Hence P and the terminal points of v and av 
lie on the same straight line. In addition, it is easy to check that the arrows 
representing v and av are in the same or opposite direction according as a is 
positive or negative. 
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EXAMPLE 2. Let P = (2, —1), and consider the two vectors u = (1, 5)p and 
v = (2, —1)p. Compute and draw each of the following vectors in the same 
plane with uand y. 


(a) ay, 
(oy =, 
(c) u— 2v, 


(ay. ga-+ ¥). 
The computations are very simple: 


Uu == Y= Cl, 5)p - (2, =1)}p a (3, 4)p, 
—Iy = —2(2, —1)p = (—4, 2)p, 


(,5)p — 2Q, —lDp : 
= 41, je + (—4, 2)p = (— 3. Dp 


au + ¥) = 2G, Pp = Gide 


u — 2v 


Figure 13 


The directed line segments representing these vectors, as well as u and vy, 
are shown in Figure 13. The easiest way to draw them is to make a list of their 
terminal points. We recall that a vector with coordinates v; and v2 and 
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initial point P = (a,b) has a terminal point equal to (a + v1, b + v»). 
Hence 


terminal point of u = (2 + 1,—)b+ 5) = 3,4), 
terminal point of v = (2+ 2, —-1 — 1) = 4, —2), 
terminal pointofu-+v = (2+ 3,—-1+ 4) = (65,3), 
terminal point of —2v = 22 — 4, —1+ 2) = (-2, 1), 


terminal point of u— 2v. = (2 — 3, -1+ 7) = (—1,6), 
2+ 4, —I + 2) = (ee 


terminal point of 4(u + v) 


The next theorem summarizes the algebraic facts about the set Up of all 
vectors in the plane with initial point P. 


(3.4) THEOREM. For each point P in R*, vector addition and scalar multiplica- 
tion in Up have the following properties: 


(1) ASSOCIATIVITY 
u+(v¥v+w)= (u+v)+w and (ab)y = a(by). 
Gi) COMMUTATIVITY 


ut+v=v+4. 


(iii) EXISTENCE OF ADDITIVE IDENTITY 


There exists a vector 0 in Up with the property that v + 0 = vy, 
for every vector Vv in Up. 


(iv) EXISTENCE OF SUBTRACTION 


For every vector Vv in Up, there exists a vector —v in Up such that 


v + (—v) = 0. 
(v) DISTRIBUTIVITY 


a(u + v) = au+ av and (a + bv = av + by. 
(vi) EXISTENCE OF SCALAR IDENTITY 


ly = ¥, 


The proof of this theorem follows easily from the definitions of vector 
addition and scalar multiplication, from the definitions of 0 and —v, and 
from the corresponding properties of addition and multiplication of real 


SEC. 3] VECTORS IN THE PLANE 567 


numbers given on page 2. The importance of the theorem is that every 
algebraic fact about vectors can be derived from the six properties listed. In 
fact, in abstract algebra, these properties are taken as a set of axioms: An 
arbitrary set V is called a vector space and its elements are called vectors if, 
for every pair of elements u and v in V and for every real number a, an element 
u + vy and an element ay in V are defined so that conditions (i) through (vi) 
are satisfied. This definition has proved to be of enormous value in mathe- 
matics and examples of vector spaces occur over and over again. In particular, 
Theorem (3.4) asserts that, for each point P in R?, the set Up is a vector space. 


EXAMPLE 3. Let vy be a nonzero vector in R?. Then the set, which we shall 
denote by Rv, consisting of all products tv, where ¢ is a real number, is an 
example of a vector space. For, if P is the initial point of v, then v lies in 
U p, and it follows that every product tv also lies in Up. Hence the sum of any 
two vectors in Rv is defined, and, since ; 


w+ sv = ((+ Ss), 
the sum is again in Ry. Similarly, if vy is in Rv and ifs is any real number, then 
s(tv) = (st), 


and (sf)v is by definition in Rv. Thus vector addition and scalar multiplica- 
tion are defined in the set Rv. Conditions (i), (ii), (v), and (vi) are auto- 
matically satisfied because they hold in the larger set Up. Finally, conditions 
(iii) and (iv) are also satisfied, since 


0 = Ov and —y = (— Dy. 


This completes the proof that Rv is a vector space. The terminal points of all 
the vectors in Rv form the straight line containing the initial and terminal 
points of the vector v. 


PROBLEMS 


1. Find the terminal point of each of the following vectors. Draw each one as a 
directed line segment in the xy-plane, and compute its length. 


(a) v = (—3, 4p, where P = (1, 0), 
(b) u (4, —3)p, where P = (1, 0), 
(c) x = (3, 0)g, where Q = (—1, —1), 
(d) a = (44, 33)o, where O = (0, 0). 
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2. Let P = (2,1). Compute the terminal point of each of the following vectors, 
and draw each one as an arrow in the xy-plane. The vectors u and v in parts 
(b), (c), (d), and (e) are as defined in part (a). 


(a) u = (3, —2)p and v = (1, 1)p (d) 3y 
fo) ee ee (e) u-+ 3y. 
(c) u-—v 
3. Let P = (0, 1), and consider the vectors x = (2, 5)p and y = (1, 1)p. 
(a) Draw the vectors x, y, and x + y in the xy-plane. 
(b) Compute the lengths |x|, |y|, and |x + yl. 


4. True or false: If P ¥ Q, then Up and Ug are disjoint sets? 

5. Let v be a vector in the plane with initial point P, and let 6 be the angle whose 
vertex is P, whose initial side is the vector (1, 0)p, and whose terminal side is v. 
Show that 

Vv = (|u| cos 4, |u| sin 6)p. 


The angle 6 is called the direction of the vector y. 

6. In physics, the force acting on a particle located at a point P in the plane is 
represented by a vector. The length of the vector is the magnitude of the force 
(e.g., the number of pounds), and the direction of the vector is the direction of the 
force (see Problem 5). 


(a) Draw the vector representing a force of 5 pounds acting on a particle at the 


point (3, 2) in a direction of ; radians. 


(b) What are the coordinates of the force vector in (a)? 


7. If a particle located at a point P is simultaneously acted on by two forces u and v, 
then the resultant force is the vector sum u + vy. The fact that vectors are added 
geometrically by constructing a parallelogram implies a corresponding Parallelo- 
gram Law of Forces. 

Suppose that a particle at the point (1, 1) is simultaneously acted on by a 


force v of 10 pounds in the direction of radians and a force u of \/ 32 pounds 


: ; 1 
in the direction of — j radians. 


(a) Draw the parallelogram of forces, and show the resultant force. 
(b) What are the coordinates of the resultant force on the particle ? 


8. Addition and scalar multiplication are defined in the set R? of all ordered pairs 
of real numbers by the equations 


(a, b) + (¢,d) = (@ ce bS- a), 
cla, b) =a, cy. 
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Show that R? is a vector space with respect to these operations. This fact shows 
that the elements of a vector space need not necessarily be interpreted as arrows. 
The principal interpretation of R? is that of the set of points of the plane. 

9. True or false? 


(a) The set R of all real numbers is a vector space with respect to ordinary 
addition and multiplication. 


(b) The set C of all complex numbers is a vector space with respect to addition 
and multiplication by real numbers. 


(c) The set VU of all vectors in the plane is a vector space with respect to vector 
addition and scalar multiplication as defined in this section. 


4. The Derived Vector of a Parametrized Curve. Consider a function whose 
domain is a subset of the set of all real numbers and whose range is a subset 
of all vectors in the plane. If we denote this function by v, then its value at 
each number ¢ in the domain is the vector v(t). Every such vector-valued 
function vy of a real variable defines two real-valued coordinate functions 
v,; and v» as follows: For every ¢ in the domain of v, the numbers v;(¢) and 
v(t) are the first and second coordinates of the vector v(‘), respectively. 
Hence, if the initial point of v(1) is P(t), then vi(f) and v,(f) are defined by the 
equation 


v(t) = (v1(d), v2(t)) Pc. CP) 


Limits of vector-valued functions are defined in terms of limits of real- 
valued functions. Specifically, the limit of v(t), as ¢ approaches fo, will be 
denoted by lim v(t) and is defined by 


lim v(t) = (lim v,(f), lim v2(t)) tim Pw) (2) 
t— tg t— tg t— to tt 


[For the definition of lim P(t), see page 542.] There is the possibility that 


t— to 
all the vectors v(t) have the same initial point Po, i.e., that they all lie in the 


vector space Up,. If this happens, (2) reduces to the simpler equation 
lim v(t) = (lim v(t), lim vo(t))P,- 
it t— to t— to 
Let C be a curve in the plane defined by a parametrization P: J — R’. 
If the coordinate functions of P are denoted by x and y, then C is the set of all 
points 


P(t) = (x(t), yO) 


such that ¢ is in the interval /. A typical example is shown in Figure 14. 
Consider a number f) in /. If ¢ is in J and distinct from fo, then the vector 
(P(t.), P(t)) represents the change in the value of P from the point P(fo) to 
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the point P(t). Thus for a change in the value of the parameter from fy to 1, 


the scalar product 


= (Pit), PO) (3) 


P(t) 


P(to) 
Figure 14 


is the ratio of the corresponding change in the value of P to the difference 
t — to. Hence the vector (3) represents an average rate of change in position 
with respect to a change in the parameter. In analogy with the definition of 
the derivative of a real-valued function, we define the derived vector of P at fo, 
denoted by dP(to), by the equation 


PG), = lim > (P(t), P(0)). 


Since P(to) = (x(to), y(to)) and P(t) = (x(A), y(A)), the coordinate form 
of the vector (P(fo), P(1)) is given by 
(P(t), PO) = (XC — x(t), VD) — (to) Pero): 
By the definition of the scalar product, 
I _ (xO = x(t) YO = vo)\ 
reree « ( P(to), PD); = (# Lt y any, : 


and so 


dP(to) = (im x(t) — x(to) a a ee 240) 
ee, P (to) 


ae jig. - 2 =e 


Recall that the derivatives of the functions x and y at fo are by definition 


x(t) = x(to) 


7 Sieh Bag \ ee 
y'(to.) = lim YW) — y(to) ; 


Lots = 4 
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provided these limits exist. It follows that 


(4.1) The parametrization defined by P(t) = (x(2), y(t)) is differentiable at 
to if and only if the derived vector dP(to) exists. If it does exist, then 


dP(to) = (x'(to), ¥’(to)) cto) 


EXAMPLE 1. Consider the curve parametrized by 
P(t) = (x), yD) = (@ — 1, 2¢ + 1), Assit < 0. 


Compute the derived vectors of P at t) = —1, at ft = 0, and at f = 1. 
Draw the curve and the three derived vectors in the xy-plane. As a result of 
(4.1), we have 


dP(to). = (x'(to), (to) Pctoy = (2to, 2) P cto): 


Hence 
dP(—1) = (-2,2)-» and P(-1) = (0, =), 


dP(0) = (, 2)P.o) and. PO) =(— 1, }), 
OPA = (252 eis and P(1), = (0, 3). 
The terminal points of the three derived vectors are, respectively, 
OA apest Fiji { +20, 
(= Fae ts 2) = (—I, 3), 
(0+ 2,3 + 2) = @,5). 
The parametrized curve is a parabola, as can be seen by setting 
x= = L 


2¢ + 1. 


i 


ee wre 
Solving the second equation for f, we get f = i , and substituting this 


si l 2 
value in the first, we obtain x = oe — 1, or, equivalently, 


Mae Ap ty ty’. 
The latter is an equation of a parabola with vertex (—1, 1). If x = 0, then 
4 = (y — 1)%, or, equivalently, +2 = y — 1, which implies that y = —1 


or 3. The parametrized curve together with the three vectors is shown in 
Figure 15. Note that each of these vectors is tangent to the parabola. 


If a parametrization P: [> R? is differentiable at fo, then we define a 
tangent vector to the resulting parametrized curve at fo to be any scalar mul- 
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Figure 15 


tiple of the derived vector dP(f)). In particular, the derived vector itself is a 
tangent vector. The set of all tangent vectors at ¢) is a subset of Up,;,), since 
every scalar multiple of dP(to) has initial point P(t»). Moreover, 


(4.2) The set of all tangent vectors to the parametrized curve P(t) at to is a 
vector space. 


Proof. This result has nothing to do with any special properties of the derived 
vector, since the set of all scalar multiples of any vector u is a vector space. 
This result is proved, if u is nonzero, in Example 3 of Section 3. If u isa 
zero vector, the result is even simpler: The set of all scalar multiples of a zero 
vector 0 is the set having 0 as its only member, and the six conditions for a 
vector space are trivially satisfied. This completes the argument. 


Consider a parametrization defined by P(t) = (x(‘), y(t)), which is 
differentiable at to and for which the derived vector dP(fo) is nonzero. If we 
sét x(t) = -d; and Ga) = d5,-then 


dP(to) =< (d1, A») P(ty)s 


where not both coordinates d; and d, are zero. The set of all tangent vectors 
at fo 1s the set of all scalar multiples 


sdP(to) = (sd, Sd2) P(t6)s 
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where s is any real number. If P(to) = (a, 5), then the terminal point of 
sdP(to) is equal to 
(sd, + a, sd, + b). 


Hence the set of all terminal points of tangent vectors at fy is the set of all 
points (x, y) such that 
x= say a a, 


4 
Sd» — b, 


= 


where s is any real number and d, and d, are not both zero. It is easy to verify 
that this set is a straight line (see Problem 4). We conclude that if the derived 
vector dP(to) exists and is nonzero, then the set of all terminal points of the 
tangent vectors at ty to the curve parametrized by P is a straight line. It 1s 
called the tangent line to the parametrized curve at fo. 


EXAMPLE 2. Consider the ellipse defined parametrically by - 


P(t) = (x(t), y(t) = (4 cos ¢, 2 sin 2), 


for every real number ¢. Compute the derived vector at fo = ; , and draw 
it and the ellipse in the xy-plane. In addition, write an equation for the 
tangent line at fo = - , and draw the tangent line in the figure. The derived 
vector is easily computed: 

dP(to) = (x'(to), ¥'(to)) Pcto) = (—4 SIN fo, 2 COS fo) P49) 


me (-4 sin 2 > 2 cos z) = (—2, V3)P cto)’ 
0 


where 


jga eo (4 COS 7 »2 sin =) = (22V3, 1). 
The terminal point of the derived vector is therefore equal to 
Qe 2 4/3). 
The parametrization P can also be written in terms of the equations 
xX =" COs, 
goon as ee 


from which it follows that 


X 9 ; 
—— + as cos ¢ + sin’t = 1. 
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Hence every point (x, y) on the parametrized curve satisfies the equation 


2 
xX 


2 
Bd 


Conversely, it can be shown (as in Example 2, page 544) that any ordered 
pair (x, y) which satisfies (5) also lies on the parametrized curve. We recognize 
(5) as an equation of the ellipse shown in Figure 16. The derived vector 


dP(to) and the tangent line at : are also shown in the figure. 


y-axis 


tangent line 


derived vector at = 


P(t) = (4 cos tf, 2 sin ft) 


Figure 16 


If s is an arbitrary real number, then the scalar product sdP(fto) in this 
example is the vector 


sdP(to) ie Woe, V3) P to) a (=2s, V3 S) P(t): 
The terminal point of this vector, since P(to) = (2\/3, 1), is the point 
(+25 4+ 2/3, V352% ad. 
Hence the tangent line at 2 is parametrically defined by the equations 


x= —2s+ 2/3, 


= (6) 
ye V/3e+d , =e eo a. 
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Solving the first of these for s, we obtain s = ie i . Substitution in 
the second then yields % 
ets a V3 
3 
y=- a x+4. (7) 


Thus any point on the tangent line satisfies (7), and it is easy to verify that, 
for any x and y which satisfy (7), there is a unique s such that equations (6) 
hold. We conclude that (7) is an equation of the tangent line. 


It is important to know that the ideas introduced in this section are 
consistent with related concepts developed earlier. For example, consider a 
differentiable parametrization defined by 


P(t) = (x(t), y(0)), for every ¢ in some interval J. 


Suppose that, for some fy in J, there exists a differentiable function f such that 


70) = (OD). 


for every t in some subinterval of J containing fy in its interior. This situation 
was described in Section 1 and was illustrated in Figure 3 (page 545). If 
such a function f exists, we say that y is a differentiable function of x on the 
parametrized curve P(t) in a neighborhood of P(to). Formulas (5) and (6), 
page 546, assert that, for every ¢ in the subinterval, 


= f'(x()) = ay 


provided x’(t) ¥ 0. Hence Mu Z is the slope of the line tangent to the graph 
of f at the point x) 


(x(), f(x)) = MO, xD) = PO. 


Moreover, in the vicinity of P(fo), the graph of f is the curve parametrized 
by P. At every ¢ in the subinterval, the derived vector of P is equal to 


aP@) = (2/10, y'(t)) Pay. 
This vector is, by definition, a tangent vector to the parametrized curve. Its 
initial point is P(t) = (x(t), y()) and its terminal point is 
O(t) = (x() + x’, WD) + yO). 


The slope of the line segment joining these two points is given by 


_o@+y@) -— yO _y@, 
ne) oo oO 
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provided x’(t) # 0. We conclude that the concept of tangency, as defined in 
terms of the derived vector to a parametrized curve, is consistent with the 
earlier notion, defined in terms of the derivative. 


PROBLEMS 


1. For each of the following parametrizations and values of fy), compute P(fo) and 
the derived vector dP(to). Draw the parametrized curve and each of the tangent 
vectors dP(t,) to the curve. 


fa) PO = Gy) = — LP), we ee 
to = —1, t) = 0, and f = 2. 

G2?) =o @sy@) =k IprcsNp omeceaem: 
to = —1, to = 0, and &% = 1. 

(c) P(t) = @¢ — 1,7), i Gl F< oe, 
fo ==.0j 09. 1p addy. 22. 

(d) PO) = @, y) = (49, = 003 <TD s 


to = O and fp = In?2. 
(e)-.P(i). =. Grc0s Bi. sin. th mf es 


TT T 
fo = O74 Sand | = — 
4 2 


f) PO = (0,7), «eee Ses 
to = —1, % = 0, and 4.= 2. 

(g) P(t) = (@¢t — 1, 2r + 4), oo at. Se Se Sees 
fo = —1,% = 0, and % = 1. 


2. For each of the following parametrizations P(t) = (x(n), y(2), find the derived 
vector dP() for an arbitrary value of ¢ in the domain. Draw the vectors dP(0), 
dP(1), and dP(2) in the xy-plane. 


xt) =: 1 
y™ = *, —l<.t <3, 


x(t) = $(e' + e—, 

y@) = 3’ —e), -0 <t<o., 
x =e 

yy = SOT es 


» =f+tyret i, 


ri 
oe —O <p ae, 
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3. The cycloid shown in Figure 4 is defined by a parametrization P(@) = (x, y) in 
which 


= a(@ — sin 8), 

y = a(1 — cos 8), —-~7 <cA< ow, 
Compute the derived vector dP(@). Sketch the curve, and draw the tangent 
vectors dP(0), dP (:) , dP(r), and dP(2r). 

4. Prove that the curve defined parametrically by the equations 


x = sd, + a, 
y sd, + b, —o <s <Q, 


where not both d; and d, are zero, is a straight line. (Note: Check the definition 
of a straight line given on page 39.) 

5. Converse of Problem 4: Prove that, if L is a straight line in R’, then it can be 
defined by a parametrization P(s) = (x, y) for which 


x= Sd; + 6, 
y = sd. + BD, —-o <s < Om, 


and not both d; and dz are zero. 

6. For each of the following parametrizations P(t) = (x(0), y(t)) and values of fo, 
compute the derived vector dP(to). Draw the parametrized curve, the tangent 
line at fo, and write an equation in x and y of the tangent line. 


(a) Pi) = (®+1,t4+1),-2 <t < ©, and % = 2. 

(b) Pf) = (®@+1,t4+ 1, -—~ <t< w,andt = 0 

(c) P(t) = (et), —2© <t < ~,and f = In2. 

(d) P(t) = ({t 
7. Let P be the parametrization defined by P(t) = (#°, $0’), for every real number f. 


,t), —~ <t < wo,and ft = 0. 


(a) Write an equation in x and y of the tangent line at ¢ = 2. 

(b) Describe the vector space of tangent vectors at ¢ = 2 and at ¢ = 0. 
8. Let f be a real-valued function which is differentiable at a. 

(a) Write an equation of the line tangent to the graph of f at (a, f (a)). 


(b) Consider the parametrization 


P(t) = (2, f(0). 


Compute the derived vector dP(a), and write an equation of the tangent 
line to the parametrized curve at a. 
9. Let P:[a, b] > R? be a parametrization for which the derivatives x’ and y’ of the 
coordinate functions are continuous. Prove that the arc length of the curve 
parametrized by P is given by 


b 
La’ = | \dP(t)| dt. 
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5. Vector Velocity and Acceleration. In this section we shall consider the 
motion of a particle in the plane during an interval of time. We shall assume 
that the particle moves without jumping. As a result, if P is the function 
which associates to every instant of time in the interval the corresponding 
position of the particle in the plane, then P is continuous; i.e., it is a param- 
etrization. The points P(t) trace out the parametrized curve over which the 
particle moves. 

Velocity is a vector concept which combines two ingredients: the 
number which measures how fast the particle is moving, and the direction 
of the motion. If the position of a particle during an interval of time J is 
described by a differentiable parametrization P: J —> R?, then the velocity 
of the particle at any time ¢ during the interval will be denoted by v(f) 
and defined to be the derived vector of P at t. Thus 


v(t) = dP(?). (i) 


Since the derived vector is a tangent vector, the velocity is also one. Specifi- 
cally, v(¢) is a tangent vector at ¢ to the parametrized curve defined by P. If 
we write P(t) = (x(2), y(4)), then it follows from the formula for the derived 
vector [see (4.1), page 571] that the velocity vector is given by 


v(t) = (228), y')) Pa. (2) 


The speed of the particle at time ¢ is defined to be the length |v(1)| of the 
velocity vector. The equation for the length of a vector in terms of its co- 
ordinates [see (3.1), page 561] implies that the speed is equal to 


Iv(t)| = Vx'(t)? + y(t). (3) 


EXAMPLE |. A particle moves in the plane during the time interval [0, 2], 
and its position at any time f¢ in this interval is given by 


P(t) = (x4), yD) = (cos rf, sin rf). 


Assume that time is measured in seconds and that the unit of distance in the 
plane is 1 foot. 


(a) Identify and draw the curve traced out by the particle, and describe 
its motion during the interval [0, 2]. 
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(b) Compute the position, velocity, and speed of the particle at tf = 0, 
t = 4,¢ = l,andt = 3. Showthese four positions, and draw the correspond- 
ing velocity vectors in the figure in (a). 


(c) How does the speed of the particle depend on time during the entire 
interval of motion? 


The parametrized curve over which the particle moves is the set of all 
points (x, y) such that 


c = cos rf, 


yi Sek, = t= 2. 
Hence the coordinates of every point (x, y) on the curve satisfy 
x2 + y? = (cos rf)? + (sin w??)? = 1. a 


The equation x? + y? = 1 is the familiar equation of the circle C with radius 
1 and center the origin, and the particle therefore moves on this circle. In 
accordance with the definition of the functions sine and cosine in Chapter 6, 
the quantity rZ? is the arc length along C in the counterclockwise direction 
from the point (1, 0) to the point P(t) = (x, y). As ¢ increases from 0 to 2, 
the values of rf? increase monotonically from 0 to 47, which is twice the 
circumference of the circle. We conclude that the particle starts from (1, 0) 
at time ¢ = 0, moves counterclockwise around the circle as time increases, 
and at ¢ = 2 has gone completely around twice and has come back to its 
starting position at P(2) = (cos 47, sin 47) = (1, 0). The curve of motion, 
1.e., the circle C, is shown in Figure 17. 

Since P(t) = (cos zf?, sin rt”), the position of the particle at each of the 
four values of ¢ given in (b) is easily computed: 


P(O) = (cos 0, sin 0) = (1, 0), 

PS) =(cos 7-4, sin x 4) = (Y , ¥?) ; 

P(1) = (os 7, sin z) = (—1, 0), 

P(3) = (os 7: Z, sina: 3) = (. 3) 
The velocity vector is 


v(t) = (X'(), VO) Pw 


(—2rt sin rf?, 2rt COS ml”) py). 


2rt(—sin rf?, COS rl?) py). (4) 


v(t) 
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P(t) = (cost, sin zt?) 


v(1) 


Figure 17 


Hence 
v0) = 0 = (0,0) po, 


- T r\/2 
v(5) = 293 (—sin 7 COS le = (—1, I)pa); 


v(1) = 2n(—sin z, cos 7) pay = 27(0, —1) Pca, 


v(3) = 203 (— sin Cos 2) = me Fe 1, 1) p(3/2). 
+ - P/2) 


The speed is by definition the length of the velocity vector. If (5, c)p is any 
vector, and a any real number, then the length of the scalar product a(b, c)p 
is given by 

la(b, c)p| = jal? + 2. 


Thus the four speeds are 


|v(O)| = O feet per second, 


lv(3)| = = 2 Ty + 12 = 7 feet per second, 


SEc. 5]: VECTOR, VELOCITY AND) ACCELERATION 581 


Wy] = S2Y2 (SEF TE = 34 feet per second. 


Since each of the velocity vectors is tangent to the curve at its initial point 
and since we know their lengths, they can be drawn without difficulty (see 
Figure 17). 

From the preceding computations it appears that the speed of the 
particle increases as time goes on. By computing the speed |v(s)| for an arbi- 
trary ¢ in the interval [0, 2], we can see that this inference is correct. Using 
equation (4) we get 


lv(1)| = 2ntV/(—sin rf)? + (cos rf?)? = 2xt, 


which shows that the speed increases linearly with time over the interval. At 
t = QO, the particle is at rest, and 2 seconds later, at ¢ = 2, its speed has 
increased to 4m feet per second. 


The motion of a particle along a straight line was studied in Section 3 
of Chapter 2 and again in Section 8 of Chapter 4. When the motion is 
restricted to a straight line, which for convenience we may take to be the 
x-axis, then the velocity vector has only one nonzero coordinate, x’(7). In 
this case velocity may be identified with x’(t), and it is not necessary to 
consider it as a vector. In our earlier treatments x’(t) was defined to be the 
velocity and it was denoted by v(t). The distance on the line which the particle 
moves during the time interval [a, b] was defined by the formula 


distance 


5 b 
“= | |o(d)| dt G3) 


(see page 232). We shall show that this definition is consistent with the more 
sophisticated notions of vector velocity and arc length of parametrized curves, 
which we are studying in this chapter. Consider a particle in the plane whose 
position is given by a parametrization P: [a, b] — R?. By the distance which 
the particle moves along the curve parametrized by P during the time interval 
from t = ato t = b we shall mean the arc length L,°. Let 


P(t) = (x(), yO), for every ¢ such that a < ¢ < b. 


We shall assume that the derivatives x’ and y’ exist and are continuous on 
[a, b]. From Theorem (2.2), page 553, it follows that 


b 
La = | VOR +O ar 
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The speed of the particle at any ¢ in [a, 5] is given by 


Hence the distance traveled by the particle along the parametrized curve from 
t= atot = bis equal to 


b 
(5.1) i ="f lv(t)| dt. 


Formula (5.1) is the generalization of the distance formula (5) from 
rectilinear to curvilinear motion. 


EXAMPLE 2. A Steel ball is rolling on a plane during an interval from ¢ = 0 
to t = 4 seconds. It has an x-coordinate of velocity which is constant and 
— equal to 2 feet per second. Its y-coordinate of velocity is $¢ feet per second, 
for every ¢ in the interval. (a) Write a definite integral equal to the distance 
(in feet) which the ball rolls during the interval from t = O tot = 4 seconds. 
(b) Identify and draw the curve on which the ball rolls. 

The coordinates of the velocity vector v(t) are x’(t) and y’(t). Hence 


(6) 


bole 
~ 
Ne 


0O<t< 4. 


By = 2, 
YAO. 


It follows at once from (5.1) that the distance which the ball rolls is equal to 


4 
oe | Ae 


r 
| s/16 + 22 dt. 
0 


This answers part (a). Using a table of integrals or integration by trigono- 
metric substitution, one can obtain 


4 

[ VieF eat = avi + sin + v9) 
0 

18.3 (approximately). 


Hence the distance the ball rolls is half this quantity, approximately 9.2 feet. 
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A parametrization which defines the position of the ball may be found 
by integrating the functions x’ and y’. From equations (6), we get 


x(t) oe a Cy 
Z 
yi) = F+e, pee. 


Nothing in the statement of the problem specifies the position of the ball at 
t = Q, so, for simplicity, we shall choose it to be the origin. This choice is 
equivalent to setting c; = c. = 0. It follows that the parametrized curve in 
which the ball rolls is the set of all points (x, y) such that 


Kom 


r 
y=qr OS1<4, 


x 
From the first equation, we get tf = 5: Hence the two equations together 


with the inequality are equivalent to 
Z 
es 


EF 
y 16 = xs = = 


The graph of this equation is the parabola shown in Figure 18, and the curve 
over which the ball rolls is that portion of the parabola indicated by the 
heavy line. 


Figure 18 


We next consider what is meant by the acceleration of a moving particle. 
The intuitive idea is that acceleration is the rate of change of the velocity 
vector. To be more precise: Let the position of the particle during an interval 
of time / be given by a differentiable parametrization P: J — R?, and let fo 
be in J. If ¢ is a number in / distinct from fo, then the velocity vectors v(f0) 
and v(t) are tangent vectors with initial points P(fo) and P(t), respectively, as 
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illustrated in Figure 19. In defining the acceleration at fo, we should like to 


form the scalar product of and the difference v(t) — v(to), and to 


— fo 
take the limit of this product as tf approaches fo. The difficulty is that, since 
the points P(t) and P(fo) are usually distinct, the difference v(t) — v(t) is 
generally not defined. (Recall that two vectors can be added or subtracted 
if and only if they have the same initial point.) It is for this reason that, 
before defining acceleration, we introduce the notion of parallel translation 
of vectors in R?. 


P(t) v(t) 


Figure 19 


Let Po be an arbitrary point in R®. We shall define a function Tp, 
whose domain is the set U of all vectors in R? and whose range is the vector 
Space Up, of all vectors with initial point Po. The definition is as follows: 
For every vector u in U, the value Tp,(u) is the vector with the same co- 
ordinates as u, but with initial point Py. Thus 


Geometrically, the vector Tp,(u) is obtained from u by moving the arrow 
representing the vector u parallel to itself until its initial point coincides with 
Py. The process is illustrated in Figure 20, and we call the function Tp, the 
Operation of parallel translation of vectors to the point Po. 


Tp, (u) 
Py 


Figure 20 
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We can now define the acceleration of a moving particle. As before, let 
the position be defined by the differentiable parametrization P: [— R*. We 
consider fo in J, and set P(to) = Po. Then the acceleration of the particle at 
ty is the vector a(fo) defined by 


a(to) = a ~> [Tpo(v(t)) — v(to)]. ee. 


Thus, acceleration, like velocity, is a vector. 


We can derive a simple formula for acceleration in terms of the co- 
ordinate functions of P. Let P(t) = (x(#), y(t)), as usual. Then 


V(to.) = (X'(t0), Y' (to) ) Pct) 
v(t) = (XO, YO) Pw, ; 
aoe, = P(f,), then 
Tp,(v(t)) = (XO, ¥'O) Pao) 
It follows that 
Tp (v(t) = v(to) = (X(t) — x'(to), YO) = ¥'(t0)) Pateds 


and thence that 


= ITro(v() = v(t] = es = XH), vO See). 


p= [5 Lo ty 
Hence 
a(fo) = lim | ——— 7, Tov) — v(to)] 
t— to 
ae a ls eae: (to) ; lim y(t) ee a) : 
13% ed * iai5 Back « P(t) 


If the two limits which are the coordinates of the preceding vector exist, they 
are by definition equal to the second derivatives x’’(fo) and y’’(fo), respectively. 
It follows that 


(5.2) If P(t) = (x(t), y@)), then the acceleration vector a(t) exists if and only 
if the second derivatives x''(ty) and y'’(to) exist. If they do exist, then 


a(to) = (x''(t0), y' (to) ) Pcty): 


EXAMPLE 3. A particle is moving with constant speed & in a fixed circle of 
radius a. Show that, at any time ¢ during the interval of motion, the accelera- 
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k? 
tion vector a(t) has constant length equal to — and always points directly 
a 


toward the center of the circle (see Figure 21). 


y-axis 


Vl 
jawyj= 


Figure 21 


We shall take the center of the circle to be the origin in the xy-plane. 
The position of the particle can then be defined by a parametrization 


P(t) = (x(), y@) = (x, y) for which 


ose Ces 
(3) 


y = asiny, 


and wu is some function of ¢ having as domain the interval of time of the mo- 
tion. To be specific, we shall assume that 0 is in the domain, and that, when 
t = O, the particle is at the point (a, 0) on the circle. Hence u = 0 when 
t = 0. We shall make the analytic assumption that the second derivative 
u’’(t) exists, for every ¢ in the interval, and it follows that x’’(t) and y’’(r) also 
exist. Differentiating with respect to ¢ in equations (8), we obtain 


x’ = —au’ sin u, 
: Se 

y= ail’ COS UW. 

Thus the speed of the particle is 

lv(t)| = Vx? + yy? = Vatu'2(sintu + cos2u) = alu’|, 
pas = 
which is assumed to be the constant k. Hence |u’| = —. Since w’ is con- 
a 


tinuous and has constant positive absolute value, it is either always positive 
or always negative (depending on whether the particle is moving counter- 
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clockwise or clockwise). We shall assume the former and conclude that 


= | 7 
u’ = —. Integrating, we obtain 
a 
k 
u=-tt+e. 


a 


Since uw = 0, when ¢ = O, it follows that 


k 
u= —t. 
a 


Substituting this value back into equations (9), we have 


F kk pe 
x’ = —a-sin-t = —ksin—-t, 
a a a 


", 
| 
S 
| 
QO 
° 
N 
| 
“= 
I 
> 
Q 
fo) 
N 
| 
_™~ 
| 


Hence 
k? 2 
se = 
ee FS OS 
a a a 
2 2 
fo SS I BY, 
a a a 
or, equivalently, 
Z i 
acca .* ae 
Be Elin pM ORM ose yeas 
2 2 
a 
Se ee 


We know from (5.2) that the acceleration vector is given by 


a(t) = (X,Y) Pw: 


Hence 
ke ke? 
a(t) = |(— —x,- = 
©) ( a° a° ”) 
kK? 
So Ae 


Since P(t) = (x, y), the terminal point of the vector (—x, —y)pi) 1s the 
point (0, 0). Thus the acceleration vector a(/) is a positive scalar multiple of 
the vector with initial point P(t) and terminal point the origin. This proves 
that a(f) is always pointing directly toward the center of the circle. The length 
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of the acceleration vector is easily computed from the preceding equation. 


We get 


ke _ k? e AAR 
k? - 

= ae -qQ=- 
a~ a 


This completes the problem. The acceleration in this example is called centrip- 
etal acceleration, and the force acting on the particle necessary to provide 
this acceleration is the centripetal force. In the case of a planet moving in 
orbit, the force is the force of gravity. 


PROBLEMS 


1. (a) Draw each of the following vectors. 


(b) 


(i) (0, 5)p,, where Po = (—1, 1). 
(ii) 4, ~—De,, where 1. = 11.8 
(ii) (1, 3)p,, where Py = (1, 1). 

(iv) (=2Z, =p, where Ps = We: 


Let Py) = (—1, 1), and compute and draw the translated vectors Tp, (u), 


where u is taken to be each of the four vectors in (a). 


2. A particle moves in the plane during the time interval from ¢ = 0 to ¢ = 2 
seconds. Its position at any time during this interval is given by the parametriza- 


tion 


PP) = Got = 0, 


where it will be assumed that the unit of distance in the plane is 1 foot. 


(a) 


(d) 


(e) 


Identify and draw the curve which the particle traces out during its interval 
of motion. 


Compute the velocity vector v(t). Find the position, velocity, and speed at 
t= 0, ¢ = 1, and t = 2. Show these positions and draw the velocity 
vectors in the figure in part (a). 


Compute the acceleration a(t). Find the times and corresponding positions 
(if any) when the acceleration and velocity vectors are perpendicular to each 
other. 


Write a definite integral equal to the distance (in feet) which the particle 
moves during the interval from ¢ = 0 to f = 2 seconds. 


Evaluate the integral in (d). 
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3. An object is dropped from an airplane which is flying in a straight line over level 


ground at a constant speed of 800 feet per second and at an altitude of 10,000 feet. 
The horizontal coordinate of the velocity of the object is constant and equal in 
magnitude to the speed of the plane. The vertical coordinate of velocity is ini- 
tially zero. However, the vertical component of acceleration (due to gravity) 
is —32 feet per second per second. (These data are realistic only if we neglect air 
resistance, the curvature of the earth, etc.) 


(a) Define a parametrization P(t) = (x(a), (2)) which gives the position of the 
particle at time ¢. Assume that the object was dropped when t = 0 and 
that P(O) = (0,0). Compute the velocity and acceleration vectors v(t) and 


a(f). 
(b) How long does it take the object to fall to the ground? 
(c) Identify and draw the curve in which the object falls. 


(d) Express the distance traveled along the curve as a definite integral. 


. Consider a particle in motion in the plane from ¢ = 0 to t = 4 seconds. Its 
position at any time during this interval is given by 


P= Gia ie = 27, —. 2°), 
where it is assumed that the unit of distance in the plane is 1 foot. 
(a) Draw the curve in which the particle moves during the interval. 


(b) Compute the velocity v(7) and the speed |v(7)|.. What are the minimum and 
maximum speeds, and at what times are they attained ? 


(c) Describe the vector space of tangent vectors to the parametrized curve at 
r=), and also at ¢ = 2. 


(d) Compute the distance traveled by the particle during the motion. 


. The position of a particle in motion in the plane is defined by the parametrization 
P(t) = eo y) , ie Py; wert < t *~ Z. 
(a) Draw the curve traced out by the particle during the interval [—2, 2]. 


(b) Compute the velocity vector v(t). Find the position, velocity, and speed at 
t = —2,¢t = 0,¢ = 1, and¢ = 2. Indicate these positions and draw the 
velocity vectors in the figure in (a). 


(c) Compute the acceleration vector a(t). Determine the four specific vectors 
a(— 2), a(0), a(1), and a(2), and draw them in the figure in (a). 


6. The position of a particle in the plane is defined by the parametrization 


a cos kt, 
b sin kt, =< ft << e., 


X 


iy 
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where a, b, and k are positive constants anda > b. 

(a) Identify and draw the curve in which the particle moves. 

(b) Prove that the particle is never at rest. 

(c) Show that the acceleration vector a(t) always points directly toward the 
origin. 


7. Prove that parallel translation has the following properties: 


(a) |Tp,(w| = |ul for every vector u. 
(b) Ifu is any vector in Up,, then Tp,(u) = u. 


(c) IfOis any zero vector, then 7p,(0) is also a zero vector. 


8. Starting at ¢ = 0, a stone at the end of a string is whirled around in a fixed 
circle of radius a at an ever-increasing speed equal to kt for some positive constant 
k. The tension in the string is equal to mla(r)|, where m is the mass of the stone 
and |a(#)| is the length of the acceleration vector. Suppose the string breaks when 
the tension exceeds some value 7. Compute, in terms of the constants a, k, m, 
and 7, the moment when the string breaks. 


6. Polar Coordinates. Since the set R® of all ordered pairs of real numbers 
has been identified with the set of all points in the plane, every point is 
uniquely determined by its x- and y-coordinates. An alternative way of 
specifying points in the plane is the following: To every ordered pair (r, 6) 
of real numbers, we assign the point P = (x, y) in R? defined by 


ee. 


The pair (*, 9) is called a pair of polar coordinates of the point P = (x, y). 
In giving the geometric interpretation of polar coordinates of a point, 
we distinguish three separate possibilities: 


Case 1. r > 0. Then ris the distance between P = (x, y) and the origin, 
since 


distance(P, (0,0)) = J/x? + y? = vV/r?2-cos7d + r? sin2d 


I 
> 
to 
Foe 
Q 
oe) 
% 
D 
+ 
— 
bo 
D 
w~ 
I 
rl 


moh 
The number 6 is the radian measure of the angle which has its vertex at the 
origin, its initial side the positive x-axis, and its terminal side the line segment 
joining the origin to P. An example is shown in Figure 22. 
Case 2. r = 0. Then P is the origin regardless of the value of 6, since 
P = {x y) = cos, Fang) = ©, 0). 
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Figure 22 


Case 3. r <0. In this case the point P = (x, y) is symmetric with 
respect to the origin to the point with polar coordinates (|r|, 0). This fact is 
illustrated in Figure 23. To prove it, we first observe that 


—rcos(@ + 7) = —rcosé@cos7 + rsin@sinz = rcos 86, 
—rsin(@ + 7) = —rsin@cos7 — rcos@sinz = rsin 6. 


y-axis 


(|r|cos @, |r|sin 6) 


P = (x,y) 
= (r cos 6, r sin 6), if r<0O 


Figure 23 


In the present situation r is negative. Hence —r is positive, and |r| = —r. 
The preceding equations therefore imply 


rcos @ = |r| cos(@ + 7), 
|r| sin(@ + 7). 


r sin 6 
Thus 
P= {x y) = cos’, fs) 
(\r| cos(@ + 7), |r| sin(@+ 7)), ifr <0, 


and this is precisely what is asserted above. 
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The major difference between polar coordinates and the familiar x- and 
y-coordinates is that if a given point P = (x, y) has one pair of polar co- 
ordinates (7, 6), then it has infinitely many: If n is any integer, then 


rcos@ = rcos(@ + 27n), 
rsin@ = rsin (@ + 2rn), 


and it therefore follows that, for every integer n, the ordered pair (r, 6 + 27n) 
is a pair of polar coordinates for the one point P = (x, y) = (rcos 6, r sin 8). 
In addition, as shown in Case 3, we have 


—rcos(@ + 7) = rcos 8, 
—rsin(@é+ 7) = rsin @. 


Hence (—r, 6 + 7) is also a pair of polar coordinates of P. We conclude 
that there is no such thing as the polar coordinates of a point. 


y-axis 


X-axis 
Q = (cos @, sin 6) 


P =ey) 
= (r cos 6, r sin 6) 


Figure 24 


Of course, it is also important to realize that every point P = (x, y) has 
at least one pair of polar coordinates (and thence infinitely many). This is 
not hard to show. We set r = Vx? + y?. If r = 0, then x = y = 0 and 
P is the origin. In this case, (r, 6) = (0, 6) is a pair of polar coordinates of P 
for any choice of 6. If r > O, then P is not the origin, and we let Q be the 
point on the unit circle (defined by x? + y? = 1) which lies on the half-line 
emanating from the origin and passing through P (see Figure 24). From our 
definition of the functions sine and cosine, we know that Q = (cos @, sin 6) 
for some number 6. It follows that 


P = (rcos 6, rsin 6), 


and so (r, @) is a pair of polar coordinates of P. 
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EXAMPLE |. Determine the x- and y-coordinates of the points with the 
following polar coordinates, and plot the points in the plane. 


Py: (r, 0) = (2 =) 


Ps: (78) = 


op ae eB aa 


If (r, 6) is a pair of polar coordinates of a point P, then the x- and y-co- 
ordinates of P are given by 


(x, Y= (1 Coss), 


Hence, for each of the above, we get 


Pi = (x,y) = (205% -2 sin z) = 6/3. 1), 


ey) = C08 a > —2 sin *) 4/3, — 1), 


ape ee) 


2 cos 7» —2 sin =) = = (V3, —1), 


= (— V3, oo), 


TV 
cos —?2 sin & 


107 107 
3 


3 cos ~ > 3 sin 


on reel? ui dMs ; 
COS i) 5: 5 


cos 7, Osinz) = (0, 0). 


E 
n= ( 
A 
ee, y) = (2 cos TT. 2 sin zz) 
-(-2 
n= ( 
¢ 


Py = (xX y) = 
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The points are plotted in Figure 25. Note that P; = P, even though the polar 
coordinates defining them are different. Although we have found the rec- 
tangular coordinates of each point, these are not necessary for plotting. For 


example, the point P; with polar coordinates (2 4 is most easily plotted by 
drawing from the origin the line segment of length 2, which makes an angle 


eee ne 
of 6 radians with the positive x-axis. 


y-axis 


Figure 25 


We next study curves in the plane defined by equations in polar co- 
ordinates. Let F be a real-valued function of two real variables. The set of 
all points in the plane whose polar coordinates satisfy the equation 


will be called the graph in polar coordinates, or simply, the polar graph of the 
equation. More formally: The polar graph of equation (2) is the set of all 
points P = (x, y) for which there exists a pair (7, @) such that F(r, 6) = 0 
and 

= Ptose, 

. = 7 Sno. 


Frequently, r is explicitly defined as a function of 6. This means that 
there is a real-valued function f of one real variable, and we consider the 


equation 
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The polar graph of this equation is that of the equation r — f(@) = 0, which 
is a special case of equation (2) in the preceding paragraph. It follows that 
the polar graph of r = f(6) is the set of all points (x, y) such that 


x= f (0) cds 0, 
y = f(6) sin 6, 


for every 6 in the domain of f. If fis a continuous function with domain an 
interval of real numbers, then equations (4) constitute a parametrization, 
and so the polar graph of r = f(@) is a parametrized curve. 


(4) 


EXAMPLE 2. Identify and draw the curve defined by the equation 
r = 4cos@ oa 


in polar coordinates. By the curve defined by an equation r = f(@) in 
polar coordinates we mean, of course, the polar graph of r = f(6). A partial 
list of values of 6 and r which satisfy equation (5) is given in Figure 26, 
and the points which have these pairs as polar coordinates are plotted. 
Since the cosine is an even function, i.e., since cos(— 6) = cos 8, it follows that 
the resulting curve is symmetric about the x-axis. The periodicity of the 
cosine implies that we may limit values of @ to an interval of length 27. 
However, we can do considerably better than that. The fact that 


cos(@é + 7) = —cos @ 


implies that if r = 4 cos 6, then —r = 4cos(@ + 7). But we have already 
observed that (7, 6) and (—r, 6 + 7) are polar coordinates of the same point. 
It follows that all the points of the curve will be included even if the values of 


6 are limited to the interval - = ‘|. Finally, in view of the symmetry 


soe 


about the x-axis, it is sufficient in plotting points to consider only values in 


It appears from Figure 26 that the polar graph of r = 4 cos @ is the circle 
of radius 2 with center at the point P = (2, 0). This can be verified as follows: 
An equation with the same polar graph is 


p= Ay cos 0. (6) 


oe -. = 7; COs 0, p= Pane, end x* + y? = -, the pelar graph of (6) 
is the same as the graph of the equation 


x> + y? = 4x 
in rectangular coordinates. This equation is equivalent to 


RAS Ee = a ey = Ft, 
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7 

0 

7 

6 
7 
4 

7 

> 
7 

2 


Figure 26 


which is the same as 
(x — 2? + y? = 2°. 


The latter is the standard form of the equation of the circle of radius 2 with 
center at (2, O). 


EXAMPLE 3. A parabola is by definition the set of all points in the plane which 
are equidistant from a fixed line and a fixed point not on the line (see page 136). 
The line is called the directrix, and the point the focus. Find an equation in 
polar coordinates of the parabola whose focus is the origin and whose direc- 
trix is the vertical line cutting the x-axis in the point (— 1, 0). The parabola is 
drawn in Figure 27. 

An equation of a curve in polar coordinates means an equation 
F(r, 0) = O whose polar graph is the given curve. In the present example, 
let (r, 6), with r > 0, be a pair of polar coordinates of an arbitrary point 
on the parabola. Then +¢ is the distance from the point to the focus, and, as 
can be seen from the figure, the distance from the point to the directrix 1s 
1+ rcos 6. Hence the geometric condition which defines the parabola is 
expressed in polar coordinates by the equation 


ee er ee 


which is equivalent to 


rl cass 


and thence to 


Co 
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directrix 


Figure 27 : 


Conversely, if r and @ satisfy equation (7), it is clear that they are the polar 
coordinates of a point which satisfies the conditions for lying on the parabola. 


EXAMPLE 4. Draw the curve defined by the equation 
P= ee, =<. <= Oo, 


in polar coordinates, where a is some positive constant. The polar graph of 
this equation is called an Archimedean spiral. If 6 = 0, thenr = a:O = OQ, 
and we conclude that the origin is a point on the curve. As @ increases from 
zero, so does r, and it follows that a spiral is traced out in the counter- 
clockwise direction. This part of the curve is drawn with a solid line in 
Figure 28. Thus the curve drawn with the solid line is the polar graph of the 
equation 
Pe: a ae. 


For negative 6, the polar graph of r = aé is obtained by reflecting the graph 
for positive 6 about the y-axis. This part of the curve, indicated by a dashed 
line in the figure, is a spiral in the clockwise direction. One way to verify this 
assertion of symmetry about the y-axis is to write the equations which define 
the curve parametrically. They are 


x= 1Ccost = at.cas#. 


y = * Sine = we sin-6, te Po, 
That is, we have a parametrization 


P(0) = (x(@), y(0)) = (a8 cos 8, aé sin 6), 
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Archimedean spiral 
polar graph of 
r= ag a>0 


Figure 28 


for every real number 6. Since the functions cosine and sine are, respectively, 
even and odd, we obtain 


x(— 6) = a(—@) cos(— 6) = —aécos 6 = —x(8), 
y(— 0) = a(—8) sin(— 0) = —aé(—sin 6) = y(@). 
It follows that the point P(—#@) = (—x(8), y(@)) is situated symmetrically 


across the y-axis from the point P(@) = (x(6), y(@)), and thus the curve is 
symmetric about the y-axis. 


EXAMPLE 5. Draw the curve, called a lemniscate, defined by the equation 


Pe Ye coes28; air 
in polar coordinates. 
We first observe that the polar graph of this equation is symmetric about 
the origin; i.e., if the pair (7, 6) satisfies the equation, then so does (—r, 6). 
This fact is a consequence of the equations 


yp? = 20 COe 2 ae 


In addition, if (7, 6) satisfies the equation, then so does (7, —6@), since the 
cosine is an even function. Thus the polar graph is also symmetric about the 
x-axis. It follows from these observations of symmetry that the entire curve 
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is obtained from that part which lies in the first quadrant of the xy-plane 
(those points for which x > 0 and y > 0) by reflecting about both the 
x-axis and the y-axis. Moreover, any point on the curve in the first quadrant 


has polar coordinates for which r > 0 and O < 6 < = Finally, if (r, 6) 


satisfies the equation, then 
2 


cos 29 = ~— > 0. 
2a? 


However, for values of @ in the interval 0, "| , cos 28 is nonnegative only if 
<0 < i . We conclude that the entire curve is obtained by symmetry 
from those points which have polar coordinates (7, 6) with r > 0 and 
0 =0.= zi . A partial list of such pairs is given in Table 2, and the cor- 


responding points are shown on the curve in Figure 29, 


y-axis 


X-axis 


lemniscate 
Figure 29 
TABLE 2 
6 je Bates 56 Approximate 
value 
0 V2 |a| 1.4]al 
ie = \/3 ee? — 
ee ae = o 
15 vo al ( ) V3 al 3\a 
aN 12 S 
: V/2 |al (3) = V2 |al 1.2|a| 
5 | Mela Gy” ="al al 
TV 
ao : 
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PROBLEMS 


1. (a) For each of the following values of 6, find the value of r such that 
y= Sond: 8 
Ne We T T 
age Ya Gata ee 
(b) Plot the seven points with the polar coordinates (r, #) found in part (a). 


(c) Draw and identify the curve defined by the equation r = 4 sin @ in polar 
coordinates. 


2. (a) For each of the following values of 6, find the value of r such that 
r = 2(1 + cos 6): 
2 IF 


SSS 3 Tere 3 # | 


No 1 
Langs pt alee 


4 


(b) Plot the seven points with the polar coordinates (r, 6), found in part (a). 


(c) What symmetry property is possessed by the curve defined by the equation 
r = 21 + cos 8) in polar coordinates ? 


(d) Draw the curve in part (c). 
3. Using a figure and the geometric interpretation of polar coordinates, show that 


r= is an equation in polar coordinates of the vertical line cutting the 


cos 
x-axis in the point (5, 0). 

4. Using a figure and the geometric interpretation of polar coordinates, find an 
equation in polar coordinates of the horizontal line cutting the y-axis in the 
point (0, 5). 

5. Assume the well-known fact that, if one side of a triangle inscribed in a circle 
is a diameter, then the triangle is a right triangle. Using this fact and the geo- 
metric interpretation of polar coordinates, show that cos 6 = = is an equation 

‘ a 
of the circle which passes through the origin and has radius a > 0 and center 
on the x-axis. 

6. Identify and draw the polar graphs of the two equations 


Ce ee (b) g= 7 


7. Consider the curves defined by each of the following equations in polar co- 
ordinates. Write each curve as the graph of an equation in x- and y-coordinates. 
Identify and draw the curve in the xy-plane. 

(a) rcos@ = —2 (De r= S —- 

~~ gin. = 2 cas 6 


(6b) sin? = 24 (e). v= a isee Example 3) 
1 — cos @ 


ic) r= —#cos? ()-+=3 
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3 1 
ee aren — (Ros See 
410) 2 — /3cos 6 


8. Let f be a real-valued function of a real variable. Prove that: 


(a) If fis an even function, then the polar graph of the equation r = /(6) is 
symmetric about the x-axis. 


(b) If fis an odd function, then the polar graph of the equation r = /(@) is 
symmetric about the y-axis. 


9. Let F be a real-valued function of two real variables. Prove that the polar 
graph of the equation F(r?, 6) = 0 is symmetric about the origin. 
10. Draw the curve defined by each of the following equations in polar coordinates 
(the number a is an arbitrary positive constant). 


(a) r = a(l + cos 8) (a cardioid). 
(b) r = a(2 + cos 8) (a limacon). 
(c) r = a(S + cos 8) (a limacon). 


(d) +r? = 2a’sin 26 (a lemniscate). 
(e) ré = 2 (a hyperbolic spiral). 


11. Consider the Archimedean spiral defined by the equation r = aé and discussed 
in Example 4. Describe the space of tangent vectors to this curve at #6 = 0, 


and also at 0 = ; 
12. (a) Show that the equations y = 4cos x and y* = 4y cos x are not equivalent. 


(b) In spite of part (a), the polar graphs of r = 4 cos 6 and of r? = 4r cos 6 
are the same. Explain. 


13. (a) Iffis a real-valued function of a real variable, prove that the polar graph 
of the equation r = f(sin 8) is symmetric about the y-axis. 


(b) Draw the curve (a cardioid) defined by the equation r = 2(1 + sin @) in 
polar coordinates. 


(c) Draw the curve (a limacon) defined by the equation r = 1 + 2sin@ in 
polar coordinates. 


7. Area and Arc Length in Polar Coordinates. This section is divided into 
two parts. In the first, which is the longer of the two, we shall study the prob- 
lem of finding the areas of regions bounded by curves defined by equations in 
polar coordinates. To solve this problem, an integral formula for area in 
polar coordinates will be derived. The second part is concerned with the 
computation of the arc lengths of polar curves by applying the methods 
developed in Section 2. 
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Let f be a continuous function which contains the closed interval [a, 5] 
in its domain. We have already observed that the polar graph of the equation 


r = f(6), (¥) 


where 6 takes on all values in the interval [a, 5], is the parametrized curve 
defined by the equations 


x6) = rcost = FC) cos @, 
yO) = rsind = 7(0) sing, G5. Tp, 


(2) 


For the area problem, we shall assume to begin with that the interval 
[a, b] has length no greater than 27, 1.e., that 


b— as 2a, (33 
and also that fis nonnegative on [a, 5]: 
f(@) > 0, — for every number @ in [a, 5]. (4) 


Let R be the subset of the plane consisting of all points which have polar 
coordinates (7, 6) such thata < 6 < bandO < r < f(6). An example is the 
shaded region R shown in Figure 30. The problem is to compute the area of 
R. The effect of the two assumptions (3) and (4) is that every point of R has 
precisely one pair of polar coordinates (r, #) with a < @ < b (except, if 
b — a = 2r, for those points of R along the line defined by 6 = a). 


y-axis 


Figure 30 
To derive a formula for the area of R, we consider an arbitrary partition 
o = {4,...,0,\ of [a, b] with the property that 
a=) < 0% **s 2G, =a. 


For each ij = 1,...,”, let m; and M; be, respectively, the minimum and 
maximum values of the function fin the subinterval [@,;_1, 6;]. In addition, 
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let R; be the subset of R consisting of all points with polar coordinates 
(r, @) such that 0;_1 < 6 < 6;and0 <r < f(8), as illustrated in Figure 31. 
It follows from the preceding paragraph that except for their boundaries the 


sets R;,..., R, are pairwise disjoint. Hence 
area(R) = area(R,) + ++: + area(R,). (4s) 
y-axis 


r= f@) 


x-axis 


Figure 31 


Each set R; is contained in a sector of the circle of radius M; and center the 
origin with angle equal to 6; — 6;_1 radians, and it contains a sector of the 
circle of radius m; with the same center and the same angle. Since any sector 
of a circle of radius p and angle a radians has area equal to 3p’, we conclude 
that foe : 

4m; (0; — 0:1) < area(R;) < 5M; (0; — 9-1), 
for eachi = 1,...,n. Adding inequalities and using equation (5), we get 


n ee n 2 
Xu i, (6; — 0;1) < area(R) < 2a = (6; — 0;_1). 
MP? 

2 


n m2 n 
However, . cs (6; — 0;_1) and > (6; — 0;_1) are, respectively, the 
Gaal ra | 


os 


“ 


f relative to the partition o@ (see page 


lower and upper sums for the function 


165). Denoting them by L, (4) and U, (5) , respectively, we have proved 
that 


L, (4) Shed RLU: (S). (6) 
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9 


“= 


for every partition o of [a, b]. Since f is continuous, so is fe and every 


function which is continuous on a closed bounded interval is integrable over 
that interval [see Theorem (5.1), page 199]. Hence the function f is inte- 


grable over [a, 5], and it therefore follows immediately from the inequalities 
(6) and the definition of integrability on page 168 that 
b 2 


area(R) = | oe 


J a 


Summarizing, we have proved: 


(7.1) If the function f is continuous and nonnegative at every point of the closed 
interval [a, b] and if b — a < 2r, then the area of the region R bounded by the 
polar graphs of the equations r = f(@), 6 = a, and @ = b is given by 

b 


“b 
area(R) = 4 | f(0)'de = 4 | r'dé. 


EXAMPLE |. The curve defined by the equation r = | + cos @ in polar co- 
ordinates, and drawn in Figure 32, is a cardioid. Compute the area of the 


r= 1+cos@ 


cardioid 


Figure 32 


region R which it bounds. Since this curve is symmetric about the x-axis, it is 
sufficient (but in this example no easier) to find the area of that part of R 
lying on or above the x-axis and to multiply the result by 2. The function f 
defined by 

f@) = ik-+.cos 6; . -0 < @ <5, 
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is both continuous and nonnegative. It follows from (7.1) that 


area(R) = 2 Bl (1 + cos nd 
0 


| (1 + 2cos@ + cos’6) dé 
Jat) 


I [1 + 2cos 6+ 4(1 + cos 24)] dé 
0 


= | (2 + 2cos 6 + 4 cos 26) dé 
0 


(26 + 2sin 6 + 4sin 26), 


poleo 


TT. 


2 


If f is negative on the interval [a, 5], the integral xf” (0)? dé is also equal 
to an area. Specifically, let us assume that f is continuous on [a, b], that 
b — a < 2n, and that f(@) < 0 for every @ in [a,.b]. Let R be the set of 
all points which have polar coordinates (r,@) such thata <6<) and 
f(@) < r < 0 (see Figure 33). Then the following formula is still valid: 


b 
(7.2) area(R) = | f (6) dé. 


=p = f(@)< 0 for 
a<@<b 


Figure 33 


Proof. Let g be the function defined by g(@) = —/(@), and let S be the set of all 
points with polar coordinates (r, 6) such that a < é =< band 0: < r = 20). 
The set S is symmetric about the origin to the set R, and we therefore con- 
clude that 


area(R) = area(S). 
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But, by (7.1), 


b b 
area(S) = 4 | g(0) do = 4 | [— f(0)I'do 


we) 
= | (0) dd, 


which completes the proof. 


If the function f can take on both positive and negative values in the 
interval [a, b] or if b — a > 2r (or both), then the integral 4 Ja Ff (6)? dé will 
in general give the sum of the areas of nondisjoint (i.e., overlapping) regions. 
It is frequently necessary to subdivide the interval [a, b] into subintervals 


9 


“= 


and to compute the integrals of f 
a desired area. 


over these subintervals separately to find 


limagon 
r=1+2sin@ 


Figure 34 


EXAMPLE 2. The polar graph of the equation r = 1 + 2 sin @ is the limacon 
shown in Figure 34. The function f defined by f(@) = 1 + 2 sin @ satisfies 
the inequalities 


a2 


JQS0 ft eS 
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Let R and S be, respectively, the regions bounded by the outer and inner 
loops of the curve, as shown in the figure. Then 


Tr /6 

area(R) = | (1 + 2 sin 0)°d9, CT) 
—1 /6 
—T /6 

areaS) = | (1 + 2 sin 6)°do. (8) 
—5r/6 


If we integrate 4/(9)? from 0 to 2z, the result will be equal to the area of R 
plus the area of S. That is, we will pick up the area of S twice and get 


21 
area(R) + area(S) = 3] (1 + 2 sin 6)dé. (9) 
0 


The consistency of equations (7), (8), and (9) can be checked as follows: 
From (7), (8), and the additivity of the definite integral, we get 


—7/6 Tr /6 
area(S) + area(R) = 1 | (1 + 2sin 6)°do + | (i= Kein nao 
/6 /6 


=—— Oi —TT 


Tr /6 
a | (1 + 2 sin 6)°d6. 


—od7 /6 


Since the function (1 + 2 sin 6)? has period 27, its definite integral over every 
interval of length 27 will be the same. In particular, 


Tr /6 
area(S) + area(R) = | (1 + 2 sin 6)°do 
a 


—oT 


27 
| (1 + 2 sin 6)°d6, 
0 


in agreement with (9). Evaluation of the integrals is left to the reader. The 
results are 


Tn / 6 
area(R) = 4 | (1 + 2sin 6)'do = 20 + 8V3, 
/6 


—T /6 
areal S) = | (1 + 2sin 0)'d@ = x — 34/3. 

J —i7/6 
It follows that the area of the region between the two loops of the limagon 
is equal to the difference, + 3\/3. 
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EXAMPLE 3. Find the area A of the region bounded by the positive y-axis 
5 
and the Archimedean spiral r = a6 (a > 0), whereO < 6 < os The region, 


shown in Figure 35, is the union of two subsets R, and Ry. The set R,; consists 
of all points with polar coordinates (7, @) which satisfy the inequalities 
O<6< 27 and0 <r < a; ie., it is the region bounded by the positive 
x-axis and that part of the spiral for which 0 < @ < 27. We find 


| 
bole 
. 
bo 
ES) 
~ 
= 
DS 
II 
bole 
a 
bo 
4 
S 
qQy 
bw 
= 


area(R,) 


Archimedean spiral 
r= age 6,a>0 


Figure 35 


The set R, consists of all points with polar coordinates (7, #) which satisfy 
5 

the inequalities 27 < 6 < se and a(@. = 29). 7 =< ae (eee eure 35). 

This region can be equivalently described as that bounded by the lines 6 = 0 


and @ = : and the two polar curves: 


y= ae, 


| 
Q 
ps 
Dp 
~{- 
NO 
4 
saa 


a) ee a 


Ni3y 
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Since 0 < r,(6) < r.(6) for every 6 on the interval [o, | , the area of Rez is 
obviously given by the formula 


a | 2 
area(R2) = 4 | (ey — "1 00 
0 


oT [2 2 
=i | [a°(@ + 2)” — a’d’]dé 
0 


2 
a 


nw /2 
— i (470 + 4x°)do 
0 


2 
9 1/2 2 2 
= “sede = 2 seg 
= Jaa (; 4. “) 27a (5 -+- ) 
ae - 


2 3 
Sa a 


We conclude that 
A = area(R,;) + area(R2) 


2 3 2 3 2 3 
4a’ eR ps DL 


2 Se ee 


An alternative way of finding the answer is to realize that the integral 


am /2 57 / 2 
| r'dé = aI ade 


is equal to the area A except for the fact that it counts twice the area bounded 


by the lines 6 = 0 and 6 = : and the curve r = a@forO0 < @< ; . Hence 


br 2 r/2 
A= 1] a ode — | a’6°d0 
0 a0) 


9 
vo 


ela a 
12 


we also obtain 


The second topic of this section is the computation of the arc length of a 
curve defined by an equation in polar coordinates. No new methods are 
needed, since the problem is simply a special case of the more general one of 
finding the arc length of a parametrized curve. As noted in the second 
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paragraph of this section, if fis a continuous function containing the interval 
[a, 6] in its domain, then the polar graph of the equation 


Fr = 7), with 2: = ‘Py, 6, 


is a parametrized curve [see equations (2)]. Specifically, the curve is the range 
of the parametrization P: [a, b] — R? defined by 


P(@) = (x(6), y(4)) = (f(8) cos 4, f(8) sin 6), 


for every @ in [a, b]. We shall make the assumption that the derivative f’ 
is a continuous function on [a, 6], and this implies that the derivatives x’ 
and y’ are also continuous. It then follows directly from Theorem (2.2), 
page 553, that the arc length of the curve from P(a) to P(d) is given by 


D 

Ly,’ = | /x'(0)2 + y'(0)2 do. 

Since . 
x’(0) = f’(6) cos 6 — f(@) sin 4, 
y'(0) = f'() sin 6 + f(6) cos 6, 

we find that 

x'(6)? + y’(6)? = f'(0)°cos?6 — 2f'(6) f(6) sin 6 cos 6 + f(@)?sin?é 
+ f’(@)?sin?é + 2f'(6) f(8) sin 8 cos 6 + f(@)?cos?6 
FOS? + FO)": 


Thus we obtain the following integral formula for the arc length L,° of the 
polar graph of the equation r = f(@), in which a < 6 < Bb: 


b 
(73) —_ | Vf + fy do. 


d, , 
Alternatively, if we set r = f(@) in the formula and write f’(@é) = : , we have 


(7a) 


EXAMPLE 4. Find the arc length of the cardioid defined in polar coordinates 
by the equation r = | + cos 6. This curve is shown in Figure 32, and the 
area of the region which it bounds is computed in Example 1. We have 


—_ = —sin 0, 


r’ = (1 + cos 6)’ = 1 + 2cos 6 + cos’#. 
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9 dr : 
weed (“) 


The trigonometric identity 


Hence 


1 + 2cos 6+ cos’6 + sin’é 


2(1 + cos 6). 


a6 = 4 
cos 5 = 5 ( + cos 6) 


implies that 


and it follows from the integral formula (7.3’) that the arc length Z of the 
cardioid is given by 


2a a 2a 
| x [4cos* Sas = 2 
0 2 0 


However, because the cardioid is symmetric about the x-axis, we conclude 


that 
L= 2| a{4eos" § ao = a] 
0 Z 0 


6 
— > Oand so 
Z 


dé. 


COs 
2 


dé. 


COS : 
Z 


1 =< 9 = =x, then cos 


L is equal to : 


6 6 
COS 4 = COS = Hence the arc length 


T 


= 8. 


0 


quan gy — 
i af cos 5 dé = 8 sin 5 


PROBLEMS 


1. In each of the following, draw the curve defined by the equation r = f(@) in 
polar coordinates. Show the region R bounded by the curve and the lines 
6 = aand 6 = 5b, and compute its area. 


(a) r = 4c0s 6,4 = Oand b = =. 

(b) r = 3011 + cos @), a = Oand b = z. 

() r= 3(1 + sin§),a = Oandb =~. 
Zz qT 7” 

(d) SS 
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2. For each of the following equations r = f(@) and pairs of numbers a and 5, 
draw the region R consisting of all points with polar coordinates (r, 6) such that 
a<6@< band0 <r < f(@). Compute area(R). 


(a) “FF =43m6, a =O and b= = 
. 4 a 30 
(b) Pe Sa ee 


ic)’ r =-20-a = 7 and B= 2a. 


1 1 
= ———__————_»q = Nand db = =: 
wages 2 cos 6 + 3 sin 6 : = Z 
(e-) r=v/2 cos 26 ,a =O and b = i . (See Example 5, Section 6.) 


3. Identify and draw the curve defined by the equation r = 4sin@ in polar 
coordinates, and show the region R bounded by the curve. Is it true in this case 
that 


«Qa 


area(R) = | r do? 


0 
Explain your answer. 


4. Each of the following curves, defined by an equation r = f(@) in polar co- 
ordinates, bounds a region R in the plane. Draw the curve and find the area 
of R. 


(a) r= ad +cos@é)a>0 (d) r= 2+ cosé 
(b) r= atl + sin@),a > 0 (c) + = 45080 
ic) f= 5 (fi) r= i—4 cos 0. 
1 
5. The curve defined by the equation r = —————— in polar coordinates is a 
1 + cos 8@ 


parabola similar to the one discussed in Example 3, Section 6. 
(a) Draw the parabola, and show the region R bounded by this curve and the 
: T 
line 0 = —- 
2 
(b) Express area(R) as a definite integral using the integral formula for area in 
polar coordinates. 
(c) Evaluate the integral in part (b) using the trigonometric substitution 
6 
z= tan - (see page 397) and the Change of Variable Theorem for Definite 
Integrals. 


(d) Write this curve as the graph of an equation in x- and y-coordinates, and 
thence compute area(R). 
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14. 


Z 


iz 


. Find the area of the region which lies between the two loops of the limacon 


r=14+ 2cos@. 


. Find the area of the region bounded by the lemniscate r? = 2a” cos 20. 
. Find the area A of the region which lies inside the cardioid r = 2(1 + cos @) 


and outside the circle r = 3. 


. The region R bounded by the cardioid r = 4(1 + sin @) is cut into two regions 


R, and R» by the polar graph of the equation r = . Compute the areas of 


R, Ri, and Ro. aaa 


. Find the arc length of the cardioid defined by the equation r = a(1 + cos @), 


where a is an arbitrary positive constant. 
Consider the spiral defined in polar coordinates by the equation r = e7’. 
Compute the arc length of this curve from 6 = 0 to @ = In 10. 


(a) Using the integral formula for arc length in polar coordinates, compute the 
arc length of the polar graph of the equation r = 2sec@ from 6 = — a 


ine =. 
4 


(b) Identify and draw the curve in part (a), and verify from the geometry the 
value obtained for the arc length. 


a. 
Consider the curve defined by the equation r = 2 cos- 5 in polar coordinates. 


(a) Find the arc length of this curve from 6 = 0 to 6 


| 
a 


2T. 


(b) Find the arc length of this curve from 0 = 0 to @ 


CHAPTER 11 


Differentiat Equations 


1. Review. This section is primarily a review of the differential equations 
studied in Section 5 of Chapter 5 and also in Section 8 of Chapter 6. We 
begin by recalling the definition of a first-order differential equation (see 
page 272): Consider an equation F(x, y, z) = 0 in which not all the variables 
need occur, but at least z does. The equation 


F(x y, ®) = (1) 


obtained by substituting = for z, 1s a first-order differential equation. By a 
x 


solution of (1) is meant any differentiable function f for which the equation 


F(x, f(), f'(x)) = 0 


is true for every x in the domain of f. If fis a solution, we write 


y = f(x). 


The general problem, given a differential equation, is to find all its solutions. 
A more specialized problem is to find a particular solution y = f(x) which 
has a specified value 6 at some specified number a, 1.e., a solution for which 
b= F(a). 


The simplest first-order differential equations are those of the type 


d 
= = f(x), where f is some given function (not to be confused with the 
x 
solutions f discussed in the preceding paragraph). Every solution of this 


differential equation can be written 


y= [ 09s + ¢, 


for some real number c. Hence if c is left as an arbitrary undetermined con- 
stant of integration, we call {fo dx + c the general solution. 
d 
We next considered differential equations of the form ae J), 
dx &(y) 


in which fand g are given functions. Equations of this type are called sepa- 
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rable, since, if we use the fact that the derivative is equal to the ratio of two 
differentials, we can “‘separate’’ the expression containing x from that con- 
taining y by writing the equivalent differential equation 


g(y) dy = f(x) dx. 


Integrating both sides, we get the equation 


[ew dy = | 400 dx + ¢, 


which defines the general solution y implicitly as a function of x. Note that 
d 
the differential equation = f(x) discussed in the preceding paragraph 1s a 
ae 


separable equation in which g(y) = I. 
Of special interest among separable equations is the first-order linear 


d it 
differential equation a + ky = 0,in which k isaconstant. This is the type 
x 


of differential equation which describes the rate of decay of a radioactive 
substance and also the rate of growth of bacteria ina culture. It can be solved 
without difficulty as a separable differential equation (see pages 276 and 277). 
However, this equation occurs sufficiently often and has such an obvious 
general solution that most people recognize it at sight. The general solution 
is 


—kx 


y = ce 


EXAMPLE |. Find the general solution of each of the following differential 
equations: 


(a) x = tan’x sec’x, 
dy = ry 
= Se 


dy S 
Se ee 


In (b) find the particular solution y = f(x) such that f(0) = —In 2, and in 
(c) find the particular solution which has value 5 when x = 0. 
The general solution of (a) can be obtained directly by integrating: 


4 2 
y | san'x sect as + « 


5 
=#tan x +c. 
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ae | 
Equation (b) is separable, since a = e*Y = ete”, Hence we may write 
x 


eo Mey see ay: 
Integrating both sides, we obtain the equation 
—¢ enero, 


which defines y implicitly as a function of x. In this case, it is not difficult 


| 
to solve for y explicitly. We first get e” = ————.. Replacing the arbitrary 
—c—e 
constant —c by simply c, and taking logarithms, we then obtain 
l 
= In : a (= ) 


as the general solution. To find the particular solution y = f(x) for which 
f(0) = —1n2, we substitute these values in equation (2) and solve the 
resulting equation for c. Thus 


—In2 = In a 
c— 1 
Since —In 2 = In, it follows that 2 = c — 1, and so c = 3. Hence the 
particular solution required is 


i a 
oa ee 
The general solution of (c) can be written down on inspection. It is 


iy = GE 
The particular solution which has value 5 when x equals 0 is obtained by 
writing 


Hence the particular solution 1s 


—— Ags 


y = 5e 


The definition of an nth-order differential equation, m > 1, is entirely 
analogous to that of a first-order equation. Let F(x, yo, yi, .. +n) = 0 be 
an equation in n + 2 variables in which not all the variables occur, but at 
least y, does. Then the equation 


pe n 
F(x»2 oy PY) = 0, (3) 


Gul : ae dx” 
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obtained by substituting the ‘th derivative 7 
Ss 


d° 
that = = r) , 1s an nth-order differential equation. A solution is any n-times- 
x 


differentiable function f such that the equation 


SO). F OO), «sf @)) = 0 


is true for every x in the domain of f- 
Our study of higher-order differential equations has thus far been limited 
to those of the type 


d' ee 
Z for y; (where it is understood 


dy, a 


where a and 6 are constants. Such an equation is a second-order, linear, 
homogeneous differential equation with constant coefficients (see page 334). 
It is called “‘linear’’ because y and its derivatives occur to no power higher 
than the first, “homogeneous,” because the right side is zero, and ‘“‘with 
constant coefficients,’ because a and 5b are constants. 

You will recall that the form of the general solution of the differential 
equation (4) is determined by the nature of the roots of its characteristic 
equation f*? + at + b = 0. The roots of this equation are given by the qua- 
dratic formula 


=o ya — 4b, 


fA fe 
dy? 2 2 
and there are three cases depending on the discriminant a? — 4b. 


Case I. If a? — 46 is positive, then there are two distinct real roots r; 
and r2. In this case the general solution of (4) is 


er eee 


where c; and cy, are arbitrary constants. 


Case 2. If a? — 4b = 0, then r; = r, = r and the general solution of 
the differential equation (4) is 


y = (1x + erje™, 
where c; and c, are arbitrary constants. 
Case 3. If a? — 46 is negative, then r; and ry are distinct conjugate 


complex numbers, 1.e., 7; = a + iB, r2 = a — iB, and B ¥ 0. In this case 
the general solution of (4) is 


y = e(c, cos Bx + cz sin Bx), 


where c, and c, are arbitrary constants. 
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The above statements imply that, if y is any solution of the differential 
equation (4), then there exist real numbers c; and cy. such that 


y= ce" + cge"2” if a? — 4b > 0, 
y = (c1x + co)e™ if a? — 4b = 0, 
y = e**(c, cos Bx + ce sin Bx) if a? — 46 < 0. 


This fact, first stated in Section 8 of Chapter 6, has not yet been proved, 
but will be in Section 4. 

Although we have thus far not used the letter D to denote the derivative, 
this notation is quite useful in the study of differential equations. We write 
d*y 
ax? 
ventions of algebra to write (D? + aD + b)y for D?y + aDy + by. In so 
doing we have defined a function, denoted by D? + aD + b, which has the 
set of twice-differentiable functions as its domain and a set of functions as its 
range. This function assigns to each function y in its domain the function 


d’y 
dx? 


d 
Dy for - and D?y for . We then combine these symbols and the con- 
x 


(D? + aD + by = 4+ a2 4 by 


as value. Such a function is an example of a differential operator. Using it, 
d*y 
dx? 


(D? + aD + b)y = 0. ( 4’) 


d 
the differential equation +a = + by = Ocan be written 
x 


Note the similarity between the operator and the characteristic equation of 
the differential equation. The latter is the equation obtained by replacing D 
in the operator by ¢ and setting the resulting quadratic polynomial equal to 
zero. - 


EXAMPLE 2. Find the general solution of each of the following differential 
equations: 
(a) dy _ 5 6y 


dx? dx 


0, 
(b) (D'+6D+ 9)y =0, 
(c} (DP = 6) Fy — 2 


For the first, the characteristic equation is 2 — 5t + 6 = 0, which is 
equivalent to (¢ — 2)(t — 3) = 0. Hence the two roots are 2 and 3, and the 
general solution is given by 


y = ce" + ce. 


sec. 1] REVIEW 619 


In (b), the characteristic equation is ?? + 6¢ + 9 = 0, which is equiv- 
alent to (t + 3)? = 0. Thus there is only one root, —3. The solutions of the 
differential equation are therefore all functions 


y= ee + ce =, 


where c; and c, are arbitrary constants. 

The characteristic equation for (c) is f — 6¢f + 10 = 0 and, since its 
discriminant is equal to —4, the roots are not real. Solving the quadratic 
equation, we find that the roots are 3 + i and 3 — i. Hence the general 
solution is 

y = e**(¢, cos X + C2 SiN X). 


EXAMPLE 3. Find the particular solution of the differential equation 
D(D — 5)y = 0 which has value equal to 2 and derivative equal to —15 
when x = 0. The characteristic equation is r(t — 5) = 0, whose roots are 
obviously 0 and 5. The general solution is therefore 


y = cye™ + coe = C1 + ce. 


The derivative 1s 


y= Sea, 


When x = 0, we are told that y = 2 and y’ = —15. Substituting these 
values in the preceding equations, we obtain 


2=¢, + ce? = C1 + C2, 
—15 = 5ce.e°° ae 5s. 
It follows that c. = —3 and thence that c; = 5. Hence the required solution 


iS 
y = 5 — 3e°. 


It is extremely useful to recognize alternative forms of the general solu- 
tion of the differential equation (D? + aD + 6b)y = Oin the case where the 
roots of the characteristic equation are the complex numbers a + i and 
a — i8. In particular, it is easy to verify that the functions 


= ce**sin(8x + k), (3:3 
= ce**cos(8x + ky), ; (6) 
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where c and k are arbitrary real numbers, are both solutions. To see that this 
is SO, we expand (5) using the trigonometric identity for the sine of the sum of 
two numbers. The result is 


ce**(sin Bx cos k + cos Gx sin k) 


= e“"[(c sin k) cos Bx + (c cos k) sin Bx]. 


¥ 


Setting c;) = csink and c, = ccosk, we obtain y = e**(c,cos Bx + cosin Bx), 
which we know to be a solution. The proof for (6) is analogous. 

Conversely, any solution y = e**(c,cos Bx + cssin Bx) can be written 
in the forms (5) and (6). For if both c; = cy = 0, then y = 0, and we need 
only set c = 0 in (5) and (6). If c, and c. are not both zero, then 
/c + co? ¥ 0, and we can write 


ee 2 Ci Co * 
j= VAAL +. ¢,2 e*” Ss COS ; : Sr GX |, 
Ver + c? Wr ae 


To put this equation in the form of (5), we set c = \/c,2 + c.? and observe 


that, since | 
Co : Cy 2 
2 4 ( = ) = 4. 
& ec? + 5) Ver + ce 


it follows from our definition of the functions sine and cosine that there 


‘ C1 
=and sink = 


C2 
Cy ee er 


exists a real number k such that cos k = 


Hence we get 
y = ce**(sin k cos Bx + cos k sin Bx) 


= ce**sin(Gx + k). 


Again, by an analogous argument, the solution can also be written in the 
form of equation (6). 

An advantage in using the forms (5) and (6) for the general solution is 
that it is easy to see what the graphs of such functions look like. As the next 
example illustrates, they are all sinusoidal curves lying between the graphs of 
y = ce’ and y = —ce™. 


EXAMPLE 4. Find and draw the graph of the particular solution of the differ- 
ential equation (D? + 2D + x2 + 1)y = 0 which has value \/2 and deriv- 
ative equal to (m — 1)\/2 when x = 0. The characteristic equation is 
2+ 2¢-+ 7? + 1 = 0, which has roots —1 + im and —1— ir. Hence 
one form of the general solution is 


y = ce *sin(rx + k). 
Its derivative is 


— = —ce “sin(rx + k) + cre “cos(rx + k). 
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me d . 
Substituting the given values of y and 4 when x = 0 into the preceding two 
equations, we get 


V2 
ae bre 


Hence, (x — 1)\/2 = —\/2 + ercosk, from which we obtain 


c sink, 


—csink + crcos k. 


/2 = ccosk. 


Figure 1 


Since c cos k does not equal zero, it follows that 


tan ko= ae 2th ] 
ccos k «/) 


2) 


and sok = ; . This implies that c = 2, and we conclude that the particular 


solution 1s 


y= 2e “sin (xx + *) 


The graph of this equation is drawn in Figure 1. Such a curve is frequently 
called an ‘“‘exponentially damped sine wave.” 
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PROBLEMS 


1. Find the general solution of each of the following differential equations. 


2 


dy ae a dy 
(a) 5 + 2e (n) re 
aes be 2 = 
(b) a! +5x+ 1 (0) (D+ 4D + 29)y = 0 
d d ae 
c) 2 = 4+) +9 (~) W+5)—=7Ix-e 
dx dx 
dy : ay oe 
ee ae ee (q) = 
2, ay Sa dy ae 
(e} 2xy er ee (r) ee 
a mo 4 2% 
(f) | = In x (s) Ee ee 
dy _ ie oes 
(g) 28 a> It (t) vane = 49 
(h) oO + 16y = 0 (u) Gx + 4dt + 4t + 3) dx ='0 
dy ae ; 
(i) 3 + 16y = 0 (v) = cot } 
ee = lL dy 3244 
oe a 
(k) y” —19y’ — 20y = 0 (x) ° = 3 sin"x cos’x 
(1) (D’ + 10D + 16)y = 0 (y) ° = 3sin’x cos y 
(m) ae ae —20y (z) se = 65 — Ay 4 2. 
dx dx? 


2. Find the particular solution of each of the following differential equations which 
satisfies the given conditions. 


ey 2 = 72 eee 
dx 
(b) “ ioe + graph passes through the point (1, —1) with a slope of 3. 
dy 
(c) y— = —x, graph passes through the point (—3, —4). 


dx 
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(d) 


(e) 
(f) 
(g) 


(h) 


(b) 


4. (a) 
(b) 


(c) 


(b) 


(c) 


6. (a) 


(b) 


(c) 


2 
“: = —g (constant), whent = 0,7 =v, ands = 5%. 
ay 
dx 
(D° — 4D + 13)y = 0, _ graph passes through (0, 5) with a slope of 2. 


(D? — 2D — 3)y =0,  y = 7 and — = 1 when x = 0. 


= 1, y = 1n9 when x = 1. 
dx 


(D’ — 12D + 36)y =0, y= 3 and & = 7 when x = 0. 


Sketch the graph of y = e”’’cos (: + *) 

Find a second-order, linear, homogeneous differential equation with 
constant coefficients of which the function in (a) is a solution. 

Find the general solution of the differential equation (4D? + 8D + 5)y = 0. 
Find the particular solution of the differential equation in (a) whose graph 
passes through the point (0 2) with a slope of — we 

Sketch the graph of the function in (b). 

Find the general solution of the differential equation (4D? — 8D + 5)y = 0. 
Find the particular solution of the differential equation in (a) whose graph 
passes through the point (0 *) with a slope of — 

Sketch the graph of the function in (b). 

Find the general solution of the differential equation (D? + 4)y = 0. 
Find the particular solution of the differential equation in (a) whose graph 


passes through the point (0 *) with a slope of — = ; 
Sketch the graph of the function in (b). 


7. Find the general solution of the differential equation (D? — 2aD + a + 1)y = 0, 
and sketch the graph for 


ma > 0 (b) a= 0 (Se =D: 
[See equations (5) and (6).] 


2. First-Order Linear Differential Equations. A differential equation which 
can be written in the form 


ax(x) & + ay = 5) 


where 4d, do, and b are given functions of x and where ay is not the zero func- 
tion, is a first-order linear differential equation. In this section we shall show 
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how to obtain the general solution of equations of this type. Since a; is by 
assumption not the zero function, we can divide both sides of the above equa- 


do(X) ==: POocand 2 = Q(x), we therefore obtain 
a(x) ay(x) 


the differential equation 


tion by a;(x). Setting 


© + POxy = Q(x), ci) 


which is the form we shall use in deriving the solution. We shall assume that 
the functions P and Q are continuous, thus assuring ourselves that they have 
antiderivatives. 

Let us suppose that the function y = f(x) is a solution to the differential 
equation (1). We shall derive a formula which expresses y in terms of P and 
Q and a constant c of integration. Conversely, it will be a simple matter to 
verify that any function y defined by this formula is a solution to (1). Hence 
the formula gives the general solution to the differential equation. 

The derivative of the product of y and a function ¢ is given by 


f (oe) = e) 2 + e'woy. (2) 


d 
Note that the first term on the right has - as a factor and the second has yp 
x 


as a factor, and that the same is true of the two terms on the left side of equa- 
tion (1). This fact suggests seeking a function ¢ which has the property that, 
if both sides of (1) are multiplied by g(x), then the left side of the resulting 
equation is the derivative of the product o(x)y. If both sides of (1) are multi- 
plied by an arbitrary ¢(x), the result is 


(x) & + ePOY = (x) 000. (3) 


Comparison of this equation with (2) shows that its left side is equal to 


d 
rs (o(x)y) provided o(x)P(x)y = ¢’(x)y, which will in turn be true provided 


g(x)P(x) = ¢’(x). (4) 


However, it is easy to find a function ¢ which satisfies (4), since, as a 
differential equation with ¢ the unknown function, it is separable. Solving it, 


we obtain 
g(x) _ 
HK Pex). 
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Whence 


| ae dx = | Pix rax, 


In|g(x)| = [Po dx + ¢, 


which implies 


and so 
lo(x)| —_ ol Playdape 


Since we are only seeking a solution to (4), and not the most general form of 
the solution, we may assume that ¢(x) is positive and also ignore the constant 
of integration. We conclude that if 


d 
then the left side of equation (3) is equal to cs (¢(x)y). 
x 


With (5), equation (3) therefore becomes 


d 
= (e(x)y) = ¢(x)Q(). 
Integration yields 


e(x)y = | o(x)Q(x) dx + ¢, 


and so 


ae 5 || oro dx + |. 


f P(x)dx 


for some real number c. Replacing ¢(x) by e , we obtain the promised 


formula: 


(2.1) y= oI Playde | [eto dx Ae e| 


Suppose next that c is an arbitrary constant and that the function y is 
defined by (2.1). Then 


yes Penrde oc [fro dx ee C. 
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The derivative of the right side of this equation is e/ ? “* Q(x) and that of the 
left side is 


dy [P(xdzx SP x)dzx 
= + yP(x)e 


Hence 


ol Playaz FE: 28 Poo ee) el P@@Z Ox), 


dx 


d 
which implies at once that - + P(x)y = Q(x); 1.e., y is a solution to (1). 
x 


We conclude that formula (2.1) gives the general solution to the differential 
equation (1). 

We strongly recommend that no one memorize (2.1). The important 
fact to remember is that, if the first-order linear differential equation 


d 
- + P(x)y = Q(x) is multiplied through by g(x) = e/?©*, then the left 
x 


side of the resulting equation is equal to the derivative of the product ¢(x)y. 
Consequently, the new equation can be integrated to give 


A(x)y = | g(x)Q(x) dx + ¢, 


which can then be solved for y. This function ¢(x) = e/?*, which enables 
us to change a seemingly nonintegrable sum into the derivative of a product 
by multiplication, is called an integrating factor. 


EXAMPLE |. Solve the differential equation 


. ody ae i 
x. 3Xy — 24 =e 
To put this in the form of (1), we add 2x? to both sides and then divide by x’. 


The result is 


iy Se 
mee oe (6) 
3 cee 
where P(x) = — — and Q(x) = 4x? + 2. An antiderivative of P(x) 1s given 


by - 
[ Po eS | — dx SES tale) Si as 


3 


and it follows that the function 


be 4 
———= > 


S P(e) dx Zs olniz 


g(x) =e = |x 
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is an integrating factor. Equation (4) shows that if ¢(x) is an integrating 
factor, then so also is —¢(x). Hence we may drop the absolute values and 
write simply 
ee, aie 
Multiplying both sides of (6) by x” *, we obtain 
ad 


_ ty =i —3 
Fai ee ai ee oe 


iets d 
It is easy to see that the left side of this equation is equal to - (x *y). Hence 
x 


2 ie Oe os ne 
an by itd 2x, 
and so 


x eee far 4+ 2x) dx + ¢ 


») 


—Z 


4 1n|x| + ets de C 


4 In|x| a? os + C. 
X* 
where c is an arbitrary constant. It follows that 
pS Se ee 


is the general solution. 


EXAMPLE 2. Find the general solution of the differential equation 
dy ae 
ae ay = 2 sux. 


Note that this is a first-order linear differential equation with constant co- 
efficients, but that it is not homogeneous, because the right side is not the 
zero function. In this example we have P(x) = 3 and Q(x) = 2 sin x. Hence 


[Poo due ps Peis 3% 


and an integrating factor is 
Pte hs 32 
¢(x) = _ (x)dx __ e i 


It follows that 


d 3x a5 Bes 
oe Cee ame 2 
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and so 
ey = [resin xdx +c. ee 


To evaluate {2e* sin xdx = 2fe%* sin x dx, we use integration by parts 


twice: 
5 pee 8x 
fe sinxdx = os dcos x 


82x 382 
= ie Cosme [cos x de : 


34 3 
—e*cosx + 3 f cos x dx 


3 e 
[er asinx 


2 . : 3 
oO Se J sin x ae 


8x 
|e cos x dx 


3U _° 3 : 
e sin x — af ” Sta Rex: 
Combining these results, we get 


3 : 3. Sis 3 - 
[eosin x ax = —e”cosx + 3e” sinx — 9 | sin x dx 


whence 
Sy, Baie 3x ° 
10 [¢ “sin x dx = é°G sin x — ees 3), 
and so 
e°" 
” 4 . e 
2+e° sya = = GB sin x — cos x). 


Returning to (7), we have 


3x 


3x 


eye “= G sin x — cos x) + ¢, 


and consequently the general solution of the differential equation Is given by 
y = ¢(3 sin x — cos x) + ce *, (8) 


where c is an arbitrary constant. 


Note that the above solution (8) of the differential equation of Example 
2 is the sum of two terms. The second, which is ce *", is the general solution 
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eo 
of the homogeneous differential equation = + 3y = 0. The first term, 
x 
1(3 sin x — cos x), is one particular solution of the nonhomogeneous 
differential equation oa + 3y = 2sin x. As we shall see, this situation 1s 
x 


typical of the solutions of linear differential equations. 


PROBLEMS 


1. Find the general solution of each of the following differential equations. 


2 
(a) ee a ag (h) Pra sh Ry EF 
es ax 
a ees Gp y= Fees x 
dx dx 
d a ee iy 
(c) ea Sy 2 (j) a : 
ax dx x 
(d) Bes 5; ie 4 (k) fee ax + bx +e 
dx dx 
dy 2x I D+9)\=T 
Be apie () D+% 
x 
9 d 9 d 3 e” 
(f) 6x y =e Be ie, (m) a -+- = ee 
dx dx x x3 
dy 9 dy COS x 
fe) ycos x + — = cosx (n) x — + Sxy = — 
dx dx x3 


Nh 


(a) Find the general solution y, of the homogeneous differential equation 


“ ae 2 0 
(b) Show that the general solution of the nonhomogeneous equation 
° 4+ 2xy = 3xe~*” is equal to the solution », in part (a) plus a particular 
solution to the nonhomogeneous equation. 


3. This problem is the general version of the preceding one. Let P and QO be 
continuous functions of x. 


(a) Find the general solution y, of the homogeneous differential equation 
— + Py = 0. 


(b) Show that the general solution of the nonhomogeneous equation 


dy ; 
" + Py = Qis equal to the solution y, in part (a) plus a particular solu- 
dx 


tion to the nonhomogeneous equation. 
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3. Linear Differential Operators. This section is divided into three parts. 
In the first, we shall systematically develop and extend the differential 
operators D? + aD + 6 which were introduced in Section 1. In the second 
part we shall use these operators to obtain directly the general solutions of 
certain linear differential equations with constant coefficients. Finally, we 
Shall show how these methods can be used to solve any linear differential 
equation with constant coefficients (whether homogeneous or not) provided 
we extend our range of functions to include those whose values may be com- 
plex numbers. 

By a linear operator we shall mean any function L whose domain and 
range are sets of numerical-valued functions and which satisfies the equations 


Lyi + yx) = LQ) + LO»), Cr) 
Liky) = kL@Q), ez, 


for every real number k& and every y, y1, and y. in the domain of L. [The 
function L(y) is frequently written simply Ly.] An important example is the 
function D, which, to every differentiable function y, assigns its derivative 


d ; es ot 
Dy = << Another example is the operation of multiplication by a real 
b 


number. That is, for any real number a, the function L defined by 
Ly = ay 


obviously satisfies (1) and (2) and hence is a linear operator. 
If L, and Ly are linear operators, then their sum is the function ZL; + L, 
defined by 
(Li + Loy = Liy + Loy, (3) 


for every y which is in the domains of both L; and Ly. It is easy to show that 


(3.1) Jf L, and Ly» are linear operators, then the sum L, + Ly» is also a linear 
operator. 


Proof. We shall show that L; + Ly satisfies equation (1) by using successively the 
definition of L; + Ls, the linearity of L; and L» separately, the commutative 
law of addition for functions, and finally the definition again. Thus 


(Li + Lo)Q1 + y2) = LiQ + ye) + LeQi + ye) 
= Diy bite Dae 2 
= (ays PF ei es Lay») 
= (L; + Lo)yy + (Li + Lp) ye. 
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The proof that L; + Ly satisfies (2) is similar: 

(Li; + La)(ky) = Li(ky) + Lolky) 
kLiy + kLoy 
k(Liy + Ley) 
k(L, > Lo)y, 


and this completes the proof. 


If L, and Ly are linear operators, then the composition of L. followed by 
L, is the function denoted by L,L» and defined by 


(L,L2)y = Ly(Loy), (4) 


for every y for which the right side is defined. The proof of the following 
proposition is entirely analogous to that of (3.1) and is left to the reader as an 
exercise. 


(3.2) If L, and L, are linear operators, then the composition L,Lz» ts also a 
linear operator. 


The composition L,Ly is also called the product of L; and L». There is no 
reason to suppose from the definition that the commutative law of multiplica- 
tion holds, and, for linear operators in general, LiL, # L.L,. However, the 
distributive laws hold: 


LL, + LiL, 
LL; = LL 3. 


L,(L, + Ls) 


(2.3) 
(Li + Le)Ls 


Proof. The first of these is proved as follows: 


(Li(L2 i Ls) )y Li((Le - L3)y) 
= [,(Loy + Ly) 
= [(Ley) + Lillsy) 
= (LyL2)y + (LiLs)y 
=" (LyL5 + Lils)y. 


The proof of the second is similar and is left as an exercise. 


An important example of the product of linear operators is the composi- 
tion of a linear operator L followed by the operation of multiplication by a 
real number a. This product, denoted aL, assigns to every y in the domain of 
L the value (aL)y which is equal to the product of a with the function Ly. 
Ehat is; 


(3.4) (aL)y = a(Ly). 
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The composition in the other order is the product La. Here we have 
(La)y = L(ay), and the latter quantity, by the linearity of L, is equal to a(Ly). 
Combining this with (3.4), we obtain the equation (La)y = (aL)y. Thus the 
operators La and aL are equal, and we have proved the following special case 
of the commutative law: 


CS.5) aL = La. 


Another example of the product, already encountered, is the operator 
D°, which is the composition D? = DD of D with itself. More generally, 
for every integer n > 1, we define the operator D” inductively by 


Di s5 BDA. 


The domain of D” is the set of all n-times differentiable functions, and, for 
each such function y, we have 


By repeated applications of (3.1) and (3.2), we may conclude that any 
function formed in a finite number of steps by taking sums and products of 
linear operators is itself a linear operator. As an example, consider a poly- 
nomial p(t) of degree n; i.e., 


PO) = Gat” + aQnit”™ | +--+ + ait + a, 
where do, ..., G, are real numbers and a, + 0. Then the function 
p(D) = eB ea 0 = se eee 


is a linear operator. To every n-times differentiable function y, it assigns as 
value the function 


p(D)y = 42! 7 + 3D" y+ = a 


d”y ay dy 
eh ++ em Waren eae + aoe Se 2 oes = doy. 
dx” dxn-1 dx 


We call p(D) a linear differential operator of order n. It is the natural general- 
ization of the differential operators of order 2, of the form D? + aD + b, 
which were discussed in Section 1. [Linear differential operators of types 
more general than p(D) certainly exist; e.g., see Problem 9. They are of 
importance in more advanced treatments of differential equations, but we 
shall not study them here.] 

The polynomial differential operators p(D) can be added and multiplied 
just like ordinary polynomials. In particular, the following theorem follows 
from the distributive laws (3.3) and the commutative law (3.5): 
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(3.6) If p(t) and q(t) are polynomials and if p(t)q(t) = r(t), then 
p(D)q(D) = r(D). 


As an illustration, observe how (3.3) and (3.5) are used to prove the 
special case of this theorem in which p(t) = at + 6b and q(t) = ct + d. 
First of all, we have 


r(t) = p()q(t) = (at + b)et + d) 


= act? + bct + adt + bd. 


Then 

p(D)q(D) = (aD + b\(cD + d) 
(aD + b)cD + (aD + b)d 
aDcD + bcD + aDd + bd 
acD? + bcD + adD + bd : 
= 7D). 


| 


The proof is the same in principle for arbitrary polynomials p(t) and q(t). 
It is a corollary of (3.6) that polynomial differential operators satisfy 
the commutative law of multiplication. Thus 


C27 ) p(D)q(D) = g(D)p(P). 


For, since p(g(t) = q(t)p(t) = r(0), both sides of (3.7) are equal to r(D). 


We begin the second part of the section by considering the differential 
equation 


which, with the notation of differential operators, can be written 
(D? — 2D — 3)jyy =e”. E53 


We have thus far defined the characteristic equation only for homogeneous, 
second-order, linear differential equations with constant coefficients. The 
generalization to nonhomogeneous and higher-order equations is: For any 
polynomial p(t) and function F(x), the characteristic equation of the differ- 
ential equation 

p(D)y = F(x) 


is the equation p(t) = 0, and the polynomial p() is its characteristic poly- 
nomial. 
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—— 


Returning to (5), we see that the characteristic polynomial, which is 
t? — 2t — 3, factors into the product (¢t — 3)(¢ + 1). It follows from (3.6) 
that D? — 2D — 3 = (D — 3)(D + 1), and (5) can therefore be written 
(D — 3)((D+ Dy =e”. 


Let us define the function wu by setting (D + 1)y = u. Then (5) becomes 
equivalent to the pair of first-order linear equations 


(D.— 37 =e, (6) 
(D + ly =u. (7) 
To solve (6), we use the technique developed in Section 2. For this equation, 
P(x) = —3and Q(x) = e~*. Hence an integrating factor is ef? 4 = e—*, 
and therefore 
ey seer, —3xr —2x Pl 


Integrating, we obtain 


—-s —4 —Azxr 
ee = fe “dx +c, = —fe “+ 4, 
whence 


u = e"(—fe "+ c1) = —Ge"* 4+ ce. 


We now substitute this value for uw in equation (7) to obtain the first-order 
linear equation 
(D+ l)y = —fe* + cye**. 


Here, P(x) = 1 and the integrating factor is e*. Accordingly, we have 


: pe x —x x 
a eb) Se ae) 


= ri +- Cie": 


Integration yields 


; Ci 
ey = —4x+ = et ee, 


Replacing fi by cy, and multiplying both sides by e~*, we get finally 
nS —ixe * ++ cet + coe. 


This, where c; and c, are arbitrary real constants, is the general solution to 
the differential equation 
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This example illustrates the fact that we can in principle solve any second- 
order, linear differential equation with constant coefficients provided the 
characteristic polynomial is the product of linear factors. Thus, if we are 
given 


(D? + aD + b)y = F(x), 


and if 2 + at+ 6 = (t — ri)(t — rz), then the differential equation can be 
written 
(D — r,)(D — ry = F(x). 


If uw is defined by setting (D — r2.)y = u, then the original second-order 
equation is equivalent to the two first-order linear differential equations 


(D PS ry)u as Fix), 
(D — r)y = U, 


and these can be solved successively to find first uv and then y. 
The same technique can be applied to higher-order equations. Consider 
an arbitrary polynomial 


We a eae oh a, 


where n > 1 and a,..., @,_1 are real constants. In addition, we assume 
that p(t) is the product of linear factors; 1.e., 


DO =F ge yy Ct — Pa). 


Let F(x) be given and consider the differential equation 


p(D)y = F(x), (8) 
which is the same as 


r sg 


d"y dy 
a ee dane re =e Ay ae + ay = F(x). 


dx” 


Since the factorization of p(7) is assumed, the differential equation can also 
be written 
(D ery) ri)(D ne ry) me Re (D Tener te F(x), 


The functions wm, ..., U,—1 are defined by 
f= (D ae ry) ee (D a Ayi 
Uy = (D =F rs) 7 (D ar Ir)Vs 


ee ee (D ns a sg 
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Then (8) is equivalent to the following set of first-order linear differential 


equations 
(D i ry)uy ae B(x); 


(D = Hie = &, 


(D setae a = Ly Als 


which can be solved successively to finally obtain y. 


In Section 4 of Chapter 7 use was made of the fact that any polynomial 
with real coefficients and degree at least 1 can be written as the product of 
linear and irreducible quadratic factors (see page 386). Suppose cf? + dt + e 
is irreducible. This is equivalent to the assertion that the discriminant 
d? — 4ce is negative. According to the quadratic formula, the two roots of 
the equation cf? + dt + e = O are equal tor; = a+ i6 and r, = a — if, 
where a = — se and 6 = —= ai 

ze ae 
these values, one can then easily verify the equation 


c(t == rit = rz) = ¢F aa dt a oe 


By multiplying and substituting 


Thus any irreducible quadratic polynomial with real coefficients is the product 
of two linear factors with complex coefficients. It follows that, for any 
polynomial 


P(t) a TS i ae CE Ss = ese <= ayl + Qo, 
with real coefficients a; and withn > 1 and a, # 0, we have 


P(t) gee! an a ri)(t = ry) rene (t ae ha 


where roots which are complex occur in conjugate pairs. 

It is this fact which introduces the third part of this section. It is very 
natural to ask the following: If the class of possible solutions is enlarged to 
include complex-valued functions of a real variable, can we proceed to solve 
linear differential equations with constant coefficients just as before, but with 
the added knowledge that now the characteristic polynomial can always be 
factored into linear factors? The answer is yes! 

To justify this answer, we must of course know the definition of the 
derivative. Let fbe a function whose domain Q is a subset of the real numbers 
and whose range is a subset of the complex numbers. Then two real-valued 
functions f; and fs with domain Q are defined by 


tix) = real part-or( 4), 
fo(x) = imaginary part of f(x). 
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That is, we have f(x) = fi(x) + ifo(x), for every x in Q. The derivative f’ is 
defined simply by the equation 


F(x) = fix) + ife(), 
for every x for which both f/(x) and fy(x) exist. Alternatively, if we write 


y = f(x), u = fi(x), and v = f(x), then y = u + iv, and we also use the 


notations 
dy du , .dv 


me Ox 


= Py: = Due 4-1 De. 


f(x) 


Logically, we must now go back and check that all the formal rules for 
differentiation and finding antiderivatives are still true for complex-valued 
functions, and the same applies to several theorems (see, for example, 
Problems 10 and 11). Much of this work is purely routine, and, to avoid an 
interruption of our study of differential equations, we shall omit it. 

It now follows, by factoring the operator p(D) into linear factors, that 
any linear differential equation 

p(D)y = F(x) 


with constant coefficients can be solved. That is, it can first be replaced by an 
equivalent set of first-order linear differential equations. For each of these 
an explicit integrating factor e/?“ exists, and by solving them successively, 
we can eventually obtain the general solution y. 


EXAMPLE 1. Solve the differential equation (D? + I)y = 2x. Since 
?e+1= (t+ )(t — 1), we have 


(D + i)(D — iy = 2x. 
Let (D — i)y = u, and consider the first-order equation 
(D + i)u = 2x. 


Since P(x) = i, an integrating factor is e’”, and we obtain 
< (e'"u) = e'*2x, 
from which it follows by integrating that 
eu = 2 | ae Oe ce 


By integration by parts it can be verified that 


AL 


[Pe (9) 


Qa 
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j ' I : 
In this case, a = ij and we know that — = —/and that 72 = —1. Hence 
i 


eg, =. = Dive’ 26% ke ci, 
and so 
u= —2ix+2+ ce. 


It therefore remains to solve the differential equation 


(D — iy = —2ix + 2+ ce. 


This time, an integrating factor is e**. Hence 


£ ey) — 2x 4 267 4b ce. 
Integration [with a second application of (9)] yields 
eS wae Ree o. = Pains ef . 


Replacing the constant — S by simply c,, and multiplying both sides by e’”, 
we obtain . 
= 2x a eet a pe 


If the function y is real-valued, then it is easy to prove that c; and c,2 are 
complex conjugates [see (4.3), page 644]. In this case cie~’” + coe” may be 
replaced by c, cos x + c, sin x, where now the constants c; and c. denote 
arbitrary real numbers. We conclude that 


y = 2x + c,cosx + ce. sin x 


is the general real-valued solution to the original differential equation 


The computations in this section were long and involved. The important 
fact we have shown is that the equations can be solved by an iteration of 
routine steps. As a practical matter, however, it is clear that some general 
computationally simple techniques are badly needed. These will be developed 
in the next two sections by breaking the problem into a homogeneous part 
and a nonhomogeneous part and attacking each one separately. 


PROBLEMS 
1. Find the general solution of each of the following differential equations. 
dy , dy —2 dy t 
(a) Page Fei ae. (d) 5 ak aad 


(b) (D+ 2)(D — 1)y = 6” (ce) (D’ + Ly = x2 +1. 
(c}-(DS 3D + Dy = 4x +3 
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8 


10. 


a. 


Using equations (1) and (2), prove that, if Z is a linear operator, then 


L0(1 — yo) = L(y) — LO»). 


. Show that equations (1) and (2) can be replaced by a single equation. That is, 


prove that a function L is a linear operator if and only if 


L(ay; + bye) = aly; + bLy>2. 


. Prove (3.2); i.e., if L; and L»2 are linear operators, then the composition LL» is 


also a linear operator. 


. Prove the second equation in (3.3), i.e., the distributive law (ZL; + Ly)L3 = 


Enh; + LeL3. 


. It might at first seem more natural to define the product of two linear operators 


L, and Ly, by the equation 
(L;L2)y = (Liy)(L2y). 


(This is the way the product of two real-valued functions is defined.) Using this 
definition, show that, if D is the derivative, then D? is not a linear operator. 


. Let f(x) be a given function and L a linear operator. Define f(x)L by the 


equation 


(FQOL)y = FLY). 


Show that f(x)L satisfies equations (1) and (2) and hence is a linear operator. 


. (a) Show that the operation of multiplication by a given function f(x) is a 


linear operator. That is, prove that, if M is defined by 
My = f(x), 
then M is a linear operator. 


(b) Show that the composition of a linear operator L followed by the operation 
of multiplication by f(x) is just the operator f(x)L defined in Problem 7. 


. (See Problems 7 and 8.) If f(x) is a differentiable function and if D is the 


derivative, then both linear operators f(x)D and Df(x) are examples of linear 
differential operators more general than the type discussed in the text. Show 


that 
xD 3 Dx, 


by applying both sides to the function y = x. Thus the commutative law of 
multiplication fails. 


Let fand g be differentiable complex-valued functions of a real variable. Show 
that the ordinary product rule for differentiation is still valid; i.e., prove that 


d d d 

oe (f(x)g(x)) = (4 foo) BUX} "f(x) (4 9) 
[Hint: Let f(x) = Ai) + ifh(x) and g(x) = gi(x) + ige(x), and apply the 
definitions of the derivative and of multiplication of complex numbers.] 


(a) Let fbe a complex-valued function of a real variable which is differentiable 
at every point x of an interval J. Show that if f’(x) = 0, for every x in J, 
then f(x) is a constant on /. 
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(b) Let f and g be two complex-valued functions of a real variable with 
f'(x) = g’(«) at every x of some interval J. Show that there exists a com- 
plex number c such that f(x) = g(x) + c, for every x in J. 


12. Find the general solution of each of the following differential equations. 


(a) (D—1)(D + 2y = 0 (b) ——~——- —4— 4 dy = 


4. Homogeneous Differential Equations. For a given function F(x) and a 
given polynomial 


pa = @” + a,_1t" '+---+ at+ ao, 
let us consider the differential equation 
p(D)y = F(x). it) 


The simplification of the theory gained by enlarging the set of possible solu- 
tions to include complex-valued functions of a real variable was demonstrated 
in Section 3, and we shall continue to use this technique. Nevertheless, our 
primary concern is still that of finding real-valued solutions to real differential 
equations. For this reason, we shall assume throughout that the coefficients 
do, . . . 5 An—1 Of the polynomial p(t) are real numbers and that F(x) is a real- 
valued function. Associated with the differential equation (1) is the homo- 
geneous differential equation 


p(D)y = 0, (2) 


called the associated homogeneous equation of p(D)y = F(x). A theorem of 
basic importance is the following: 


(4.1) THEOREM. Jf yo is any particular solution of (1) and if y is the general 
solution of (2), then y + yo is the general solution of (1). 


Proof. Once the statement of this theorem is understood, its proof becomes almost 
a triviality. First, one should realize that, strictly speaking, the general solu- 
tion of a differential equation is the set of all its solutions. Referring to a 
function y as the general solution is actually a common and very convenient 
misuse of language. What it really means is that y depends not only on x, but 
also on one or more other variables which are arbitrary constants of integra- 
tion and can take on any real, or complex, values. That is, we have a function 
y(x, Wy, ..., Un), and, for every set of real (or complex) numbers c1,... , Cn, 
the function y defined by 


Y 3 PO) 


is a Solution of the differential equation. Conversely, corresponding to every 
solution f(x), there exist numbers cy, ..., C, such that f(x) = o(x,¢1,..., Cn). 
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Thus y, as expressed in the above equation, does exhibit the set of all solu- 
tions. 

With this understanding, it follows that (4.1) is equivalent to the follow- 
ing proposition. Let yo be an arbitrary solution of equation (1). Then: 

(i) If y; is any solution of (1), then there exists a solution y2 of (2) such 
that y, = ye + yo. 

(ii) If yo is any solution of (2), then yz + yois a solution of ()). 

The proofs use only the fact that p(D) is a linear operator. To prove 
(i), we set yo = y1 — yo and check that y. is a solution of (2). We get 


P(D)y2 = p(D)O1 — Yo) = P(D)y1 — p(P)y0 
= F(x) — F(x) = 0. 


For (ii), we need only verify that y2 + yo is a solution of (1). We have 
P(D)(v2 + Yo) = P(D)y2 + p(D)yo 
=0+ FQ) =F), - 
and the proof of (4.1) is complete. 


As a result of this theorem, our approach to the problem of solving the 
differential equation p(D)y = F(x) will be divided into two parts. We shall 
first concentrate on finding the general solution of the associated homo- 
geneous equation p(D)y = 0, and then consider methods of finding a par- 
ticular solution to the original nonhomogeneous equation. The remainder 
of this section will be devoted to the first part. 

We begin with the second-order linear homogeneous differential equa- 
tion with constant coefficients: 


aed b)y =, (3) 


The general solution of this equation has been presented earlier (see page 617), 
but without proof. We shall supply the proof now by factoring the linear 
operator D? + aD + 6 and solving the equation by the iterative technique 
of Section 3. The characteristic polynomial can be written as the product 


eP+tatt+b= (t — ri)(t — Pro), 


where the roots 7; and r, are either both real or distinct conjugate complex 
numbers. Equation (3) can therefore be written 


(D — r:)(D — r)y = 0. (4) 
(4.2) THEOREM. The general solution of the differential equation (3) [or 
equivalently, of (4)] 1s: 
(i) y = cye"1* + coe"2”, if ry A Pre, Or 
Gi) y= (ax + ce”, ifr = re = 7, 


where c, and cz are arbitrary complex numbers. 
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Note that these solutions include all the real-valued ones, since the set of 
all real numbers is a subset of the set of all complex numbers. 


Proof. Let y be an arbitrary solution of (4). We define the function u by setting 
u = (D — ro)y. Then (4) is equivalent to the two first-order linear equations: 


(D — ri)u = 0, 

(D — re)y = u. 
An integrating factor for the first of these is e~"1*, because, in the notation of 
first-order linear equations, we have P(x) = —r,. It follows that 


| ee 
— = Q, 
x (ew) 


Integration yields e~"17u = cy, whence 
ee or oS ar for some complex number c¢. 


Substituting the expression for u into the second differential equation above, 
we obtain 
(D — ny = cre’. 


This time an integrating factor is e~"2*, and so 


d —Tox Ty—To)z 
ae (5) 


We now distinguish two cases. 


Case 1. r; ¥ ro. Integration of (5) yields 


1 xz 

—Tot (r —7 ) 

e . y c1e : . C25 
ry ro 


for some complex number c2. Multiplying both sides by e"2* and replacing 
Ci 


by simply ci, we get 
a2 


y = cye1* + coe"2", 


Case 2... ry = re = r. Then e1~72)* = @ =. 1, and.(5) reduces to 


d =I 

—(e ae ine 

rs ( y) 1 

Integrating, we obtain e~"*y = cyx + C2, for some complex number c2, and 
it follows that 


y = (cx + ene”. 
We have now proved that, if y is an arbitrary solution of the original 


differential equation (4), then there exist complex numbers c; and cy (either 
or both of which may perfectly well be real—every real number is a special 


SEC. 4] HOMOGENEOUS DIFFERENTIAL EQUATIONS 643 


case of a complex number) such that y is of form (i) if 71 ¥ re and of form (ii) 
ifr; = ro = r. Conversely, it is a simple matter to check by substitution that, 
for any complex numbers c, and ce, the function cye"1* + c2e"2* is a solution 
if r, ¥ ro and the function (cyx + ce” is a solution if r; = re = r. This 
completes the proof of the theorem. 


How can we use Theorem (4.2) to obtain the general real-valued solution 
of the differential equation (D? + aD + b)y = (D — 14)(D — ~m)y = 0? 
Suppose, to begin with, that r; and r, are both real and that r; ¥ rz. It follows 
from part (i) of Theorem (4.2) that the function defined by 


= cye"1” + c,e"2” for any two real numbers c, and ¢o, 6 
b] 


is a solution, and it is certainly real-valued. There is only one obstacle in the 
way of the conclusion that (6) is the general real-valued solution. This is the 
a priori possibility that there might exist complex numbers c; and cs, which 
are not both real, but are such that c,e"1* + c,e"1” is a real-valued function. 
This, in fact, cannot happen, as the following argument shows: Let 
Cy = ¥1 + i6,; and cz = Yo + ibs. Since 


(¥; + ide + (7, + id.)e"? 
is by assumption real-valued, then so is 
(¥1 + i8:)e717 + (Vo + i6y)e"2* — VYye"1* — Yoe72* = i(6e"" + 62€"2"). 
Hence 
6;e"17 + §,e727 = 0, 
and so 


sets 72)” ———— 


This equation holds for all real values of x. But, since 71; — r2 ¥ 0, the left 
side has constant value only if 6, = 0, which in turn immediately implies 
that 6. = 0. Hence 6, = 6, = 0, and the argument is complete. With this 
problem disposed of, it now follows from (4.2)(i) that, if 7; and 72 are real 
and unequal, then the general real-valued solution of the differential equation 
is given by (6). 

A similar situation arises if r; = ro = r. In this case r must be a real 
number, and it is a corollary of part (ii) of Theorem (4.2) that the function 
defined by 


y = (c1xx + ce, for any two real numbers c, and ¢2, iJ) 


is a solution, and, of course, it is real-valued. Again, we must show that it is 
not possible to have complex numbers c,; and C2, not both real, such that 
(cxx + c,)e’* is a real-valued function. The proof of this fact is similar to that 
of the analogous result in the preceding paragraph, and we leave it as an 
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exercise. It then follows from (4.2)(i1) that the general real-valued solution is 
given by (7). 

The third and final possibility is that the roots r,; and r2 of the character- 
istic polynomial are distinct conjugate complex numbers. In this case, we 
need the lemma: 


(4.3) Ifr: = a+ iB, ro = a — iB, and B ¥ O, then the function defined by 
y = ce” + c,e"2”, ~~ for arbitrary complex numbers c, and Co, 


is real-valued if and only if c, and cz are complex conjugates. Moreover, if 
Cy = Y + 16 and cz = Y — 16, then 


y = e*(27 cos Bx — 26 sin Bx). 


A proof in the “‘if?’ direction is given in detail in (8.3) on page 347. In 
addition, the above equation giving y in terms of a, 6, Y, and 6 is also derived 
there. The “‘only if’? direction can be proved in the same direct manner as 
the analogous results for the other two cases: Let cy = Y; + 16, and cy = 
Y2 + 162, substitute these values into c,e"1” + c,e"2”, and impose the condition 
that y is real-valued. It will then follow that ¥; = Y. and that 6; = —6,. 
Again, we leave this task as an exercise. 

Let us replace the real constants 2Y and —26 which appear in the 
equation in the last line of (4.3) by c; and cz, respectively. It is then a 
corollary of (4.3) and (4.2)(i) that the general real-valued solution of the 
differential equation (D? + aD + b)y = (D — 71)(D — ry = 0 is 


y = e**(c, cos Bx + c2 sin Bx), for any two real numbers c; andc,, (8) 
provided r; = a+ iB, ro = a — i@, and B # 0. 

This completes the proof that second-order, homogeneous, linear differ- 
ential equations with real constant coefficients have the general solutions 
first described in Section 8 of Chapter 6 and again in Section 1 of this chapter. 

The higher-order homogeneous equations can be solved in the same way. 
If 


PWD = 8? + G,ial" 7 ey ee ee 


then the general solution of the differential equation p(D)y = 0 can be ob- 
tained by first factoring p(D) to obtain an equivalent set of n first-order 
linear differential equations which are then solved successively to find y. 
As an illustration, we shall solve a third-order equation by this method. 
Following this example, we shall give (without proof) the form of the general 
real-valued solution for arbitrary order n. 
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EXAMPLE |. Find the general solution of the differential equation 


The characteristic polynomial is p(t) = t? — 372+ 4. Substituting —1 for 
t, we obtain p(—1) = 0, from which it follows that (¢ + 1) is a factor of 
p(t). Dividing, we find that 


e-32+4=(t4+ Ie -— 44+ 4 = ¢4+ DE — 2). 
Hence the differential equation can be written 
(D + 1)(D — 2)*y = 0. 


We set uw, = (D — 2)*y and uw, = (D — 2)y and, by so doing, obtain the 
equivalent set of three first-order equations 


(D+ Du = 0, : 
(D — 2)u. = , 
(D — 2)y = wu. 


The general solution of the first of these is uw. = ce”, and the second equa- 
tion is therefore 
(D 5s 2)u2 = emu: 


An integrating factor is e *”, and so 


d —227r ashe —x —3xr 
a ES, uo) = @€ C1ée = ¢(,e ‘ 
ot ) 
Hence 
8 Cy 258 
ec ee * + Co, 


C 
from which it follows by multiplying both sides by e?” and replacing — = 
by simply c, that 

tie = Ce” + *cne™. 


The third equation is now seen to be 
(D — 2)y = cye* + cre. 


Again, e * is an integrating factor, and we have 

d —2x _ ~—22 —wx 2 3. - —3x 

ax © Vee tee a ae) = ie’ ee. 
Integration yields 


i 5 
e Ware toigule * 4 Cox + C3. 
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Multiplying both sides by e?* and replacing — = by simply c; again, we have 


y= ce ~ + (cox + cs)e, 


where ci, C2, and c; are arbitrary real constants. This is the general real- 
valued solution and completes the example. 


We now give the general solution for arbitrary order n. Let 


PO = 0 ee ee 


and suppose that factorization into real-valued irreducible factors yields the 
product 


PD = @ — 1 6 = YE ea Fay a tetas 


where 7,...,m, and ,,...,, are positive integers, the factors t — r; 
are all distinct, and the factors 7? + c;t + d; are all distinct. For each factor 
(t — r,)’"i, define the function 


fix) = Cae fl Ces 


7) 


for arbitrary real numbers Ca, ..., Cim;- 


For each factor (#7 + c,t + d;)"i, leta; + i8; and a; — i8; be the roots of 
?? + c;t + d;, and define the function 


g(x) = (Ajx "+ Apex” + +++ + Ajn,Je*cos Bx 


(ix se SFO ee = ees 4 Bjn, Je" sin BX; 10) 


for arbitrary real numbers Aji, ..., Ajn; 


and Ba, ce 8g Big; 


Then it can be proved that 


(4.4) The general real-valued solution of the homogeneous differential equation 
P(D)y = Ois the sum 


Y= AG) Po Sa ee 


Note that, since m, + +--+ m, + 2m, +--+ + 2n; = n, the number 
of arbitrary constants in the general solution is equal to n, the order of the 
differential equation. 
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EXAMPLE 2. Find the general solution of the differential equation 
(D + 2)(D — 5)3(D? + D+ 1)*y = 0. 


This is an equation of order 8. The polynomial #? + ¢ + 1 is irreducible with 


| 3 l 3 
roots equal to — 5 + Me and — 5 ne . It follows directly from (4.4) 


that the general real-valued solution is 


y= Ce ™ + (Cix’ + Cx + Cie 


mee et C,je" Gos - x + (Cyx + Ce Sin = x, 
for any set of real numbers C), Co,..., Cy. 
PROBLEMS ‘ 


1. Find the general real-valued solution of each of the following differential equa- 
tions. 


dy .¥ = gy 
(a) oe ) oe 
d° d 
(b) me + -0 (g) (D? + 2D +4 6)y = 0 
R = a ae 
eee + 6D + 5)y = 0 ae a i ae ae 
(d) (D’ —2D+ 10)y = 0 (i) ee = 
oe dt? dt — 
2 ne ; d°x dx = 
(ec) 4D + 4D — 3)y = 0 (j) a ee ees 


2. Show by substitution that the function defined by y = (cix + cy)e” is a solution 
of the differential equation (D — r)?y = 0. 

3. Let r be a real number, and c; and cy complex numbers. Prove that, if 
(cxx + cs)e™ is a real-valued function, then c; and cz must both be real. 

4 Letr; = a+ iB andr. = a — iB, where a and @ are real numbers and 6 ¥ 0. 
Prove that, for any two complex numbers c, and cy, if the function 


oe” + ee 


is real-valued, then c; and cz are complex conjugates of each other. 

5. For each of the following differential equations, find the general real-valued 
solution by first finding an equivalent set of first-order linear differential equa- 
tions and then solving these successively to find y. 


et t(D: 2)0D —.3)y = .0. 
(b) (D — 2)(D? — 6D + 9)y = 0. 
(c) (D — a)(D — b)(D — c)y = 0, where a, b, and c are distinct real numbers. 
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6. Find the general real-valued solution of the differentia] equation 
(D* + 4)(D — 3)y = 0 


by solving an equivalent pair of equations. Use the fact that we have already 
derived the general real-valued solution of the second-order, homogeneous, 
linear differential equation with constant coefficients. 


7. Using Theorem (4.4), which gives the general real-valued solution of the 
nth-order differential equation P(D)y = 0, solve each of the following. 


(a) (D— 2)(D+ 1)*y = 0 (g) eo, gl. 
ae 6 dx3 dx? feo. 
dy dy ; 
(b) ——7—+4+ 6y =0 (hyo (24 Fy 0 
dx3 dx 
c) @-3)O4)O=-5y=0 @ @W+)@ +24 HY 6 
(d) D(D'+ 3D —4)y =0 -Pi p0yy = 6 
4 
oo) O12 =ton aa ar, 
dx4 
2 a2 ee d'y d'y dy Sy 
f) (D+ 3) =e (I) ie Ja 


S. Nonhomogeneous Equations. We continue to consider a given real-valued 
function F(x), a given polynomial 


P(t) = [" = ii So eS in + ait a Qo, 


with real coefficients, and the resulting differential equation 


: P(D)y = F(x). (5 


In this section our objective is to develop techniques for solving many 
examples of (1) quickly. This is in contrast to Section 3, where it is demon- 
strated that (1) can always be solved by successively solving first-order linear 
equations. The task of solving all these first-order equations can be extremely 
tedious, and we therefore look for a simpler method. 

The technique to be discussed is based on two premises. The first is the 
fact, demonstrated in Section 4, that one can write down the general solution 
of the associated homogeneous equation 


P(D)y = 0 (23 


immediately, once p(t) has been factored into irreducible polynomials. The 
second is Theorem (4.1), page 640, which asserts that the general solution 
of (1) is equal to the general solution of (2) plus any particular solution of (1). 


SEC. 5] NONHOMOGENEOUS EQUATIONS 649 


Hence the problem of solving (1) reduces to that of finding any one solution. 
As an introductory example, consider the differential equation 


(D? + 4D + 3)y = 3x? + 2x — 6. C3) 


The characteristic polynomial is #2 + 4¢ + 3, which factors into the product 
(¢ + 1)(t + 3). The associated homogeneous equation is therefore 


(Poe OD 2. 0, 
and its general solution, which we shall denote by y,, is given by 
Vu Cie eee 


for arbitrary real numbers c, and cy. To obtain the general solution of (3), 
it remains to find a particular solution y,, and any one is as good as any 
other. If we can find one, it follows by Theorem (4.1) that y, + y, is the 
general solution of (3). i 

Since the derivatives of polynomials are polynomials and since the right 
side of (3) is the polynomial 3x? + 2x — 6, it is natural to seek a polynomial 
solution. Let us set 


Yo = A,x” + Aca = + -:--:-+ Apo, yithud< 54.0, 


and try to find n and coefficients A,,..., 49 so that (D? + 4D + 3)yp = 
3x2 + 2x — 6. Since Dy, is a polynomial of degree n — 1, and D*y, is a 
polynomial of degreen — 2, it follows that (D? + 4D + 3)y, isa polynomial 
of degree n. If this polynomial is to equal 3x? + 2x — 6, for every x, then it 
must be the case that nm = 2. Hence we let 


Ye = AX eae, 
Then 
Dy, = 2ZAX-+bB, 
= 2, 
and so 


(D? + 4D + 3)yp 


2A + 4(2Ax + B) + 3(AX? + Bx + C) 
= 3Ax?+ (84 + 3B)x + 24 + 4B + 3C. 


The right side of the preceding equation is equal to 3x? + 2x — 6, for all real 
numbers x, if and only if 


3 = 3A, 
2 = 8A + 38, 
+6 = 24-+ 48---3C. 
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Solving these equations, we get A = 1, B = —2, and C = 0. The function 
Py = X= ex 
is therefore a particular solution of (3). It follows from Theorem (4.1) that 
Y=), +e = te ee ee 


is the general solution, where c, and cy. are arbitrary real numbers. 
A second example is the differential equation 


(D? + 4)y = 3e°”. (4) 


The characteristic polynomial ¢? + 4 1s irreducible with roots 27 and —2i, 
and the general solution y, of the associated homogeneous equation 
(D2 + 4)y = 0 is therefore given by 


yr, = €1008 2x + Ce Sin 2x, 


for arbitrary real numbers c; and c.. A particular solution y, of (4) will be 
any function with the property that its second derivative plus four times itself 
is equal to 3e”. Since the derivative of an exponential function is again an 
exponential function, an intelligent guess is that a particular solution might 
be a function of the form 
j, = Ae. 

Trying this, we obtain 

Di ae 

Dg Ss 2TAE™, 
and so 

(D? + 4)y, = 25Ae** + 4de* = 29Ae*. 

Obviously, 29Ae** = 3e*” if and only if A = 3%. Hence a particular solution 


of the differential equation (D? + 4)y = 3e°* is 


ee eS 
i nk: ae 


and it is a consequence of Theorem (4.1) that the general solution is 
SA a Vh a Vo = C; COS 2x + C2 sin 2x + se”, 


where c; and c, are arbitrary real constants. 

The method of finding particular solutions used in the above two 
examples is sometimes called the method of undetermined coefficients. For a 
third example, consider the differential equation 


(D2? + 4)y = 7 sin 2x. Bes 
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The associated homogeneous equation (D? + 4)y = 0 is the same as for 
equation (4), and its general solution is 


Yr = €, 008 2x + C2 Sin 2x. 


In attempting to find a particular solution of (5), one might reason from the 
experience of the preceding examples as follows: The right side is the function 
7 sin 2x. Since the derivatives of any function which is a linear combination 
of sines and cosines are functions of the same type, a reasonable candidate 
for a particular solution is some function y, of the form 


Vp = Asin 2x + Bcos 2x. 


However, when we try to determine values of the coefficients A and B which 
will make y, a solution, we find that (D? + 4)y, = 0. This is actually not 
surprising, since any function of this type has already been shown to be a 
solution of the associated homogeneous equation. Hence we must try some 
other form for y,. 

With some ingenuity and willingness to experiment, it is not at all im- 
possible to discover a particular solution to (5). Nevertheless, this example 
serves to illustrate the desirability of analyzing our technique to reduce the 
amount of inspiration necessary. For this purpose, we again consider the 
differential equation (1); 1.e., 


p(D)y = F(x) es 


with given function F(x) and polynomial p(t) of degree n. To apply the 
method of undetermined coefficients, it is necessary that the right side of (1) 
is itself a solution of a homogeneous linear differential equation with constant 
coefficients. Hence in the discussion which follows, we make the assumption 
that there exists a polynomial q(t) of degree m such that q(D)F(x) = 0. 

Such a linear differential operator g(D) is sometimes called an annihilator 
of the right side of (1). For the differential equation (3), a suitable annihilator 
is the operator D’, since 


D3(3x? + 2x — 6) = 0. 
For equation (5), whose right side is the function 7 sin 2x, we have 
D7 sin 2x) = 14cos2x and D?(7 sin 2x) = D(14 cos 2x) = —4(7 sin 2x). 
Hence 

(D? +- 4)7 sin 2x = 0, 


and thus D? + 4 is an annihilator of the right side. Similarly, it 1s easy to 
see that 
(D — 5)3e°* = 0, 


from which it follows that D — 5 is an annihilator of the right side of 
equation (4). 
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Returning to the general case, we first observe that, if y is an arbitrary 
solution of the differential equation (1), then 


q(D)p(D)y = q(D)F(x) = 0. 


That is, every solution of (1) is also a solution of the equation 


q(D)p(D)y = 9, (6) 


which is homogeneous and of order m + n. Let us denote by y,, the general 
solution of (6), and by y, the general solution of the associated homogeneous 
equation of (1), i.e., of the equation p(D)y = 0. It is clear that y, is also a 
solution of (6). We know that y,, contains m + n arbitrary constants and 
that y, contains n. It follows from the form of the general solution of a 
homogeneous linear differential equation with constant coefficients, as pre- 
sented in Theorem (4.4), page 646, that we can write 


Ve = Vn t 4, cine 


where uw contains m arbitrary constants. It will follow that these are the 
“undetermined coefficients’ of the particular solution we are seeking. 

Let y, be a solution of (1); 1.e., y; is some function with the property that 
p(D)y, = F(x). Then y, is also a solution of (6), and so there exists a set of 
values for the n constants in y, and for the m constants in wu such that, with 
these values substituted, we have 


Vi = Va t+ u. 
Hence 
F(x) = p(D)y1 = P(D)On + Y) 

= p(D)y;, + p(D)u 

= 0+ p(D)u 
p(D)u. 
Thus we have proved that there exists a set of values for the m constants in u 
such that, with these values substituted, the resulting function wu is a solution 
of the differential equation (1). Moreover, it can be proved that there is only 


one such set of values. Hence, as the following examples will illustrate, these 
‘undetermined coefficients” are uniquely determined by the equation 


p(D)u = F(x). (8) 


We take for the particular solution y, the function wu specified by equations 
(7) and (8). 


EXAMPLE |. Find the general solution of the differential equation (5), 1.e., of 


(D? + 4)y = 7 sin 2x. 
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As indicated earlier, the general solution of the associated homogeneous equa- 
tion (D? + 4)y = Oils 
Yr = C1 COS 2X + ce Sin 2x. 


Moreover, we have observed that (D? + 4)7 sin 2x = 0, and therefore the 
operator D? + 4 is an annihilator of the right side. Hence we consider the 
homogeneous fourth-order equation 


ee 4D ayy — + ayy = 0. 
The general solution of this equation is given by 


Ve = (Ax + B)cos 2x + (Cx + D) sin 2x 
= Bcos2x + Dsin2x + Axcos 2x + Cx sin 2x, 


for arbitrary real numbers, A, B, C, and D. It is clear that 
Vy, = Bcos 2x -- P sin2x; . 
and we therefore set 
i= ANRC 24 Cx smi 2x: 
It follows that 
Du = Acos2x — 2Axsin 2x + Csin 2x + 2Cx cos 2x, 


and 
D?u = —2A sin 2x — 2A sin 2x — 4Ax cos 2x 
+2C cos 2x + 2C cos 2x — 4Cx sin 2x 
= (4C — 4Ax) cos 2x + (—4A — 4Cx) sin 2x. 
Hence 
(D? + 4)u = (4C — 4Ax + 4Ax) cos 2x + (—4A — 4Cx + 4Cx) sin 2x 


4C cos 2x — 4A sin 2x. 


Setting (D? + 4)u = 7 sin 2x, we obtain 
4C cos 2x — 4A sin 2x = 7 sin 2x. 


Since this equation is to be true for all real values of x, we conclude that 
4C = Oand —44 = 7. Thus C = Oand A = —4. It follows that the func- 
tion uw, with these values substituted for the constants, is a solution of the 
given differential equation. We therefore set 


Vo = eX COS 2X, 
and obtain 


Y= yr t+ Vp = €1 008 2x + co sin 2x — ix cos 2x 


as the general solution. 
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EXAMPLE 2. Find the general solution of the differential equation 
—— + — — 2y = 5e™. 


The characteristic polynomial is #2 + t — 2 = (¢ + 2)(¢ — 1), and the 
differential equation can therefore be written 
(D? + D — 2)y = (D+ 2)(D — Dy = S5e-”’. 
The general solution of the associated homogeneous equation 
(D+ 2p = f= 
is given by 
Vi = Ce Fe 


The right side of the nonhomogeneous equation is the function Se’. 
Since D(S5e~*”) = —2(5e **), it follows that 


(D + 2)5e* = 0, 


and so D+ 2 is an annihilator. We therefore consider the third-order 
homogeneous equation 


(D + 20D + 20D = typ = 4 ee 
whose general solution 1s 
py = (Ax + De +-Ce, 


for any real numbers A, B, and C. Recognizing that Be™** + Ce” = yn, we 
set 


2x 


i= Aw = 


The constant A is evaluated by setting (D? + D — 2)u = Se **. Differ- 
entiating to obtain the left side, we get 


Du = Ae~** — 2Axe™, 


D4 = —2Ae~* — 2Ae * + 4Axe * 
—4Ae * + 4Axe >, 


Hence 


(D? + D — 2)u 


—4Ae”* +. 4Axe 2 ae aye DAxe 
= —3Ae *,; 


We therefore obtain the equation —3Ae ** = 5e *”, which implies that 
A = —8. Hence the function u obtained by substituting this value for A 
is a particular solution. Thus we take 
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and it follows that the general solution is given by 
Y=Wt+yVp = c1e * + Coe” — 3xe, 


for arbitrary real numbers c, and Cp. 


EXAMPLE 3. Solve the differential equation 
D3(D + 2y = 8x + 1. 


The characteristic polynomial is 7°(t + 2), whose roots 0 and —2 occur with 
multiplicities three and one, respectively. It follows that the general solution 
of the associated homogeneous equation is 


Yr = (crx? + Cox + €3)e°? + cye™ 
= (x? + ox+c3+ ce . : 


The right side of the given nonhomogeneous equation is 8x + 1, and the 
operator D? is an annihilator, since D?(8x + 1) = 0. Hence we consider the 
sixth-order homogeneous equation 


D?D(D'+ 2)y = D(D+ 2y = 0, 
the general solution of which is 
y= Ax + Bx 4 er =] Dx} £ + Fe. 
It is obvious that y, = Cx? + Dx + E+ Fe’, and we set 


u = Ax*+ Bx’. 
It follows that 
De =e Ss SES 
D?u = 12Ax? + 6Bx, 
D3u = 24Ax + OB, 
D'n = 2AA, 


and so 
D*?(D + 2)u = D*u + 2D u 
= 244 + 48Ax + 12B 
= 484x + 244 + 12B. 


Setting D?(D + 2)u = 8x + 1, we obtain the equation 
48Ax + 244 4+ 12B = 8x4 l, 
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which is true for all real values of x if and only if A = 4 and B = —4. It 
follows that a particular solution of the differential equation 


D3(D + 2)y = 8x + 1 
is defined by 


and the general solution is, therefore, 
Y= Vat Vp = CX? + cox + 3 + Ce + GX — 2x*, 


for arbitrary real numbers c, C2, and c3, and c4. 

The method of undetermined coefficients which we have studied in this 
section is not applicable to all linear differential equations with constant 
coefficients. For example, it will not work for the equation (D? + 2)y = tan x, 
because there is no polynomial g(t) with the property that q(D)tan x = 0. 
Of course, this equation can be solved by replacing it by two first-order 
linear equations and solving these successively as in Section 3. It can also be 
solved by another well-known technique, called the method of variation of 
parameters, which we shall not discuss in this book. Finally, it is important 
to realize that there exist tables in which particular solutions of the equation 
p(D)y = F(x) are tabulated for a variety of functions F(x). In particular, 
see pages 112 to 114 of the book by E. J. Cogan and R. Z. Norman, Handbook 
of Calculus, Difference and Differential Equations, Prentice-Hall, 2nd ed., 
1963. | 


PROBLEMS 


1. For each of the following differential equations, a particular solution can be 
found by inspection. Obtain sucha solution y,, and also find the general solution. 


(a) (DE 3D.— 10y=4 
(b) (D’ + Ly = 2x 

2 
(c) es Shy ie en 
dy 


o ax? 


12 y= Se eee 
dx 


fe) (1D = 3p = oy =e 


dy dy Br 
f as ay a 
©) dx? : dx ‘4 2 


(g) D(D* — 9)y = 2e” 
(h) (D’ + 4)y = 3sinx 
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nN 


6. 


(i) (D+ 4)y =3sinx+4x4+ 8 


3” _ 5cosx — 5sinx 


©) dx? dx 


(k) (D> + 3)y = 5cos 3x 
(l) (D’ + 2D — 2)y = 13 cos 2x. 


. Find the particular solution f(x) of the differential equation (D? + 1)y = 2x 


which has the property that f(0) = 3 and f’(0) = 2. (Hint: Find the general 
solution first and then apply the given boundary conditions to find the values of 
the constants.) 


. Find the particular solution y(x) of the differential equation 


dy dy ox 


with the property that y(0) = 4 and y’(0) = 3. 


. The current 7 in a given alternating-current circuit is a function of time ¢ and is 


governed by the differential equation 


de oe = 


Find i as a function of ¢t, if i = 0 and < = Gwen f = 6. 


. Find the general solution of each of the following differential equations. 


(a) (D—-2)y=4x -5 (h) D(D+3)y = 5x —2 
2 
(b) = 3) + 2)y = 4x + 3 (i) a + 4y = 5 cos 3x 
dy dy rae we one 
(c) =: a i ZY =e (j) 1 + 9y = 2s1n 3x 
(d) D(D —.2)y = 6x’ + 2x — 6 (k) (D’ + 1)y = 10sinx + 3e” 


ee + — — 2)y = 6e ” (1) (D’ + 1)y = 4sinx + 8cosx 
oe + D — 2)y 


(g) (D + D—2)y 


be 2 oe ee Ae) (DS Dy =) Se’sin x 


be Ae) CD 4 OD Dy. =! 3e" cos x. 


Hyperbolic Functions. In solving linear differential equations, we have 


encountered many combinations of e”1* and e’2*. Among these, two particular 
linear combinations occur sufficiently often that they have been given special 
names. These are the two functions $e” + 4e-” and ge” — ge”. 
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Let us look at some of the properties of these two functions, which 
motivate their names. First, we observe that each is the derivative of the 
other: 


(te + 3e°*) = 


bole 
ian) 
| 
hole 
Lay) 


d 
dx 
d x 


1 7% ree cs eS OT stipe near 
qe — 3) 


This fact implies, of course, that each function is its own second derivative. 
There is a clear analogy here with the trigonometric functions cosine and 
sine, each of which is, up to sign, the derivative of the other and each of 
which is the negative of its own second derivative. 

The result of squaring these two functions is 


(he? + he*y = de + E+ de™, 
(Je? — fe“ = de — E+ 4e™, 
from which it follows that 
(Re? + $e77)’ — Ghe* + de")? = IL. (1) 
Thus the difference of their squares is equal to 1, and this fact is analogous to 


the trigonometric identity cos’x + sin’x = 1. It is a consequence of equa- 
tion (1) that, for every real number f, the ordered pair 


(x,y) = Ge’ $e gee 
satisfies the equation x? — y? = 1 of an equilateral hyperbola. Similarly, we 
know that, for every real number ¢, the ordered pair 


(x, vy) = (Os tea 


is a point on the unit circle x2 + y? = 1. With this motivation, we define the 
hyperbolic cosine, abbreviated cosh, and the hyperbolic sine, abbreviated 
sinh, by setting 


(2) 


for every real number x. 


It is trivial to verify that 


cosh(— x) = cosh x, 
(6.1 ) 


sinh(—x) = —sinhx, for every real number x. 
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Thus, like their respective trigonometric counterparts, the hyperbolic cosine 
is an even function, and the hyperbolic sine is an odd function. 
Equation (1) now becomes the identity 


(6.2 ) cosh*x — sinh?x = 1, _—_for every real number x, 


and we have also already established the two derivative formulas 


(6.3 ) nes cosh Y= anh x, 

dx 
and 

d . 
(6.4) — sinh x = cosh x. 


dx 


Since each of the two functions, cosh and sinh, is equal to its own second 
derivative, each is a solution of the differential equation (D? — l)y = 0. 
More generally, the functions cosh kx and sinh kx, where k is an arbitrary 
real constant, are both solutions of the differential equation 


0? akan 78. (3) 


From the linearity of the differential operator D? — k? it follows that the 


function defined by 
y = c, cosh kx + cz sinh kx, 3 


for any two real numbers c; and cy, is also a solution. In fact, (4) is an 
alternative form of the general solution of the differential equation (3). 

To prove this fact, let yo be an arbitrary solution of (3). The character- 
istic polynomial is t2 — k?, which equals the product (¢ — k)(t + k). Hence 
there exist real numbers A and B such that 


Yo = Ae”* + Be **. 


However, we have 


kz kz ka ma 


re ee ee Ae ee. 
cosh kx + sinh kx = 5 — 5 “fh 5 5 a 
ent ett eft aoe ‘ 

cosh kx — sinh kx = es ae a, eae “+ ig = 
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It follows that 
Yo 


A(cosh kx + sinh kx) + B(cosh kx — sinh kx) 
= (A + B)coshkx + (A — B)sinh kx, 


which is of the form of (4). This completes the proof. 

In drawing the graphs of the hyperbolic functions, we make use of the 
fact that cosh is an even function, and sinh is an odd function. In addition, 
each of the following simple results follows quickly from the definition of the 
relevant function: 


(i) ces 0 = 1, 
Gi}; sinh = 0, 
(6.5) (ili) cosh x > 0, for every real number x, 
(iv) sinh x = 0 if and only if x = 0, 
| (v) sinh x > 0, for every x > 0. 


Applying these facts to the first and second derivatives, we conclude that the 
graph of sinh x has positive slope everywhere, has therefore no local maximum 
Or minimum points, and passes through the origin with slope 1. Moreover, it 
is concave upward if x is positive, is concave downward if x is negative, and 
as a result has one point of inflection at the origin. Similarly, the graph of 
cosh x has positive slope if x is positive, negative slope if x is negative, and 
one critical point at (0, 1). It is concave upward everywhere, from which it 
follows that there are no points of inflection and the critical point at (0, 1) is 
a local minimum. The graphs of the two functions are drawn in the same 
xy-plane in Figure 2. 

The curve which is the graph of the equation y = cosh x is called a 
catenary. More generally, a catenary is the graph of an equation of the form 


x : 
y = acosh (*) , where a is a nonzero constant. The word comes from the 
<a 


Jatin word meaning ‘‘chain,” and it can be shown that, if a chain or cable 
with a uniform weight per unit length is suspended between two points, then 
it hangs in the shape of a catenary. 

In a manner completely analogous to that for defining the other four 
trigonometric functions from the sine and cosine, we define four other 
hyperbolic functions. They are the hyperbolic tangent, denoted by tanh; the 
hyperbolic secant, denoted by sech; the hyperbolic cosecant, denoted by csch; 
and the hyperbolic cotangent, denoted by coth. The definitions are 


(5) 
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Figure 2 


In the problems at the end of the section you are asked to find the derivatives 
of these functions. These derivative formulas, and also the many identities 
among the hyperbolic functions, are all closely akin to those for the trigo- 
nometric functions. 

The inverse hyperbolic functions are also defined. For example, y is the 
inverse hyperbolic cosine of x if and only if x is the hyperbolic cosine of y. 


That is, 
y =arccoshx  ifand only if x = cosh y. 


_ The domain of the inverse hyperbolic cosine arccosh is the set of all real num- 
bers greater than or equal to 1, and the range is chosen to be the set of all 
nonnegative real numbers. The definitions of the other inverse hyperbolic 
functions follow the same pattern. 
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PROBLEMS 


1. Find the following derivatives. 


d d 
(a) = cosh 5x (f) Tx sech x 
d 2 — d 
(b) — (cosh’3x + sinh’3x) (g) —cschx 
dx dx 
d 9 d 
(c) rs In cosh(x” + 1) (h) ye coth x 
oe ——— Pas 1 
pa 2 aS aes Ss 
(d) ee sinhy/ x2 + 1 (i) i tanh TLP 
d d ‘ 
(e) We tanh x (Gj) A cosh (:) 


2. Find the following integrals. 
(a) | sinh 7x dx (f) | 2x sinh(2x” + 1) dx 


tanh’2x sech’2x dx 


(b) | cosn 4 a (g) 


coth x In sinh x dx 


(d) | tanh x dx (i) | cosh’x dx 


(c) | sinh 3x cosh’3x dx (h) 


2 
sech’ x 


(e) a (j) 


sinh>x dx. 
tanh x 


3. Prove the following identities. 


(a) cosh 2x = cosh?x + sinh?x (c) 1 — tanh?x = sech?x 
(b) sinh 2x = 2sinhxcoshx (d) coth*x — 7 ="taere 
4. Prove that cosh x is an even function and that sinh x is an odd function. 


5. Find the general solution of each of the following differential equations in 
terms of the hyperbolic functions. 


2 
(a) ad = fy (d) (D” _ k*)y =x-+sinx 
dx? 
(b)... LD. = Dy 0 (e) (D° — 16)y = Ssinh 8x 
2 2: Tf 
() dy _ emer (f) (D° — 16)y = 5cosh 4x. 


dx? 


10. 


es. 


12. 
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. Prove that sinh x = 0 if and only if x = 0. 


. Find lim 


sinh x 


cosh x 


rI— 2 


. Identify and draw the curve defined parametrically by 


x(4) =‘cosh¢ 
y(t) = sinh?, —-o <t< ow, 


. (a) Draw the region R bounded by the x-axis, the hyperbola x? — y? = 1, 


and the straight line joining the origin to the point (x, y) on the hyperbola 
defined by x = cosh?¢ and y = sinh zs, for an arbitrary t > 0. 


(b) Compute the area of the region R. 


Find the arc length of the graph of the equation y = 3 cosh (;) from the 
point (0, 3) to the point (6, 3 cosh 2). 3 


ee 


Compute the following derivatives. 


(a) ri arctanh x (b) 4 arcsinh x. 
dx dx 


Sketch the graph of the following equations. 
= y = tanh x (>) y = arccosh x. 


APPENDIX A 


Properties of Limits 


In this appendix we shall establish the fundamental properties of limits 
stated without proof in Theorem (4.1), page 32. Before restating the theorem 
and giving the proof, we recall one of the basic facts about inequalities and 
absolute values, which we shall use. Called the triangle inequality, it asserts 
that, for any two real numbers a and 5b, 


ja + 5] < Jal + [4]. 


This result is stated and proved for a + 6b in (1.4), page 7. It holds equally 
well for a — b, since 


ja — 0 = a+ (8) Se ee 


The theorem, which we shall prove, is the following: 


THEOREM. /f lim f(x) = 6b, and lim g(x) = bo, then 


wt—a 


GQ) lim (f(x) + 8@)] = br 
Gi) Ir pfx): = 203. 
(in) lim fx )e(x) = bee 


(iv) ee /~ = i‘ provided bz ¥ 0. 

According to the definition of limit, the hypotheses tell us that, for any 
positive number e, there exist positive numbers 6, and 6, such that if x is in the 
domains of both f and g and if 0 < |x — a] < 6, and 0 < |x —a| < &, 
then | f(x) — b,| < € and |g(x) — bs| < €. Where it is relevant in the proofs 
which follow, we shall assume without explicitly stating it the condition that 
x lies in the appropriate domain of f or g (or both). 


Proof of (i). Let € be an arbitrary positive number. Then there exist positive 
- 
: 2 
and |g(x) — b.| < 5 whenever 0 < |x — al < 6. (It is legitimate to write 


numbers 6; and 6, such that | f(x) — b;| < = , whenever 0 < |x — al < &, 


664 


APP. A] PROPERTIES OF LIMITS 665 


5 in these inequalities, since the definition specifies the existence of 6’s for any 


- : ; € 
positive number e. Given a choice of €, we can then take = to be the number 
which implies the existence of 6, and 62.) We set 


= minimum {6, 62}. 
Let us now suppose that O < |x — al < 6. It follows that 0 < |x — al < 4 
- 


and0 < |x — a| < 6, and thence that | f(x) — 6i| < and lg(x) — b| < 5 


Clearly, 
ILf() + a(x)) — [b1 + b2]] = IEF) — 51) + [e(x) — J]. 
Hence, using the triangle inequality, we obtain 


L(x) + g(x)] — [br + ball < [f@) — bil + I) — Bo] <5 +55 © 


Thus we have shown that, for any e > 0, there exists a 6 > O such that, 
whenever 0 < |x — al < 6, then |[f(x) + g(x)] — [b1 + &:]| < €. By the 
definition of limit we have therefore proved that 


lim [f(x) + g(x)] = 61 + b:, 
which is the result (1). 


Proof of (ii). Suppose first that c = 0. Then cf is the constant function with 
value 0, and cb; = 0. Hence 


\ef(x) — cbi| = |0 — 0] = 0, 


for every x in the domain of f. Thus, for any two positive numbers € and 6, 
it is trivially true that 


lcf(x) — cbi| < € — whenever 0 < |x — a] < 4, 


and (11) is therefore proved in this special case. We next assume that c ~ 0, 
and choose an arbitrary positive number e. There then exists a positive 
number 6 such that 


|f(x) — bil < Te ’ whenever 0 < |x — al < 6. 


It follows immediately that 
é 
lef(x) — cbi| = |eL f(x) — bill = lel [f@&) — 41] < le icra 


whenever 0 < |x — a| < 6. This completes the proof of (ii). 
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Proof of (ii). Let € be an arbitrary positive number. Select a positive number 
M such that |b,| < M and |b.| < M. Then there exist positive numbers 
61, 62, and 63 such that 


If(x) — bi <5 oe? & provided <x ——-al < oy 


lg(x) — bo] < provided 0 < |x — al < 52, 


aM’ 
lg(x) — b| < M—|b.|, provided 0 < |x — al < 63. 
We set 


6 = minimum {6}, 62, 63}, 
and in the remainder of the argument we assume that 0 < |x — aj < 6. It 


then follows that all three of the above inequalities hold. Using the last one 
together with the triangle inequality, we first observe that 


le(x)| = |(g(x) — be) + be] < |g) — be] + |b] < (M — |b2|) + |b.| = M 
Next we obtain 


f(g) = b,b,| = |f(x)g(x) — big(x) + big(x) — b,b,| 
= |g(x)Lf(x) — bi] + bilg(x) — 5] 
< |gQOL fC) — b1]| + bilge) — 52]| 
= |g(x)| [f@) — bi] + {bil Ig) — 42]. 
Finally, therefore, 


€ € €  € 
If@)g(x) — bibs] < Maa, + Mae = 5414 5 = & 
and the proof of (iii) is finished. 
Proof of (iv). We shall prove the simpler statement: 
lim Me sets, provided b, ¥ 0. CE) 


zr—a 2(x) bs 


This fact, together with (iii), then implies 


ae a 
in aa as ee 


which is the result (iv). Since it is assumed that b, ~ 0, there exists a number 
m such that 0 < m < |b,|. Hence there exists a positive number 6, such that 


Ig(x) — bel < |be| — m, 
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whenever 0 < |x — a] < 6;. But 
bo] = |—bo| = |(g(x) — be) — g(r) 
< |g) — 52| + |g@)]. 
Hence, if 0 < |x — al < 61, we have 
le(x)| 2 |be| — |g) — 52] > [be] — ([d2| — m) = m. 
Taking reciprocals, we therefore obtain 


] ] 


———- <—> whenever 0 < |x — aj < 61. 
Ig(x)| am 


Now let € be an arbitrary positive number. There exists a positive number 6, 
such that 
g(x) — by] < mlble, whenever 0 < |x — al < &:. 
We set ; 
§ = minimum {6), 69}. 


It follows that, if 0 < |x — a| < 6, then 


he a hae Se 
g(x) b» bog(x) 
l 1 
i ae aie - 
eG Toa (BO? ~ 
le) — 5, 
Af) \bo| Ex : 
] 
m|bo| mlbs\€ = €. 


Thus (1) is proved, and, as we have seen, (1) and (ili) imply (iv). This com- 
pletes the proof of the theorem. 


APPENDIX B 


Properties of the Definite 
Integral 


Five basic properties of the definite integral are listed at the beginning of 
Section 4 of Chapter 4. Of these, two are proved in the text and one is left 
as an exercise. The remaining two will be proved here. 

Let f be a function which is bounded on a closed interval ia.-b|. - This 
implies that [a, b] is contained in the domain of f and that there exists a 
positive number B such that | f(x)| < B for all x in [a, b]. We recall that, for 
every partition o of [a, 6], there are defined the upper and lower sums for rs 
relative to o, which are denoted by U, and L,, respectively. Moreover, it has 
been shown (see page 168) that 


= =, Like < Usa = vs, ( ee, 


for any two partitions o and 7 of [a, b]. The function J is defined to be inte- 
grable over [a, b] if there-exists one and only one number, denoted P J, with 
the property that 


b 
nef fe U,, 


for any two partitions o and 7 of [a, b]. It is an immediate consequence of 
this definition and the inequalities (1) that /f is integrable over [a, b] if and 
only if, for any positive number e, there exists a partition o of [a, 5] such that 
U, — L, < . A similar corollary, which we shall also use in the subsequent 
proofs, is the statement that fis integrable over [a, b] and £ f = Jif and only 
if, for every positive number e¢, there exists a partition o of [a, b] such that 
On = J; <eand J = Ll =< €. 

The first property of the definite integral, which we shall establish in this 


section, is presented in the following theorem: 


THEOREM 1. The function f is integrable over the intervals [a, b] and [b, c] if 
and only if it is integrable over their union [a, c]. Furthermore, 


[t+] 
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Proof. We first assume that f is integrable over [a, b] and over [b,c]. Let 
€ be an arbitrary positive number. Then there exists a partition a, of [a, 5], 
and a partition cg, of [b, c] such that the following inequalities hold: 


mo fda [fee 


ae 


It follows from these that 


Ca eee.) oe 
(Uf r+ [A-a+tn 


Let us set ¢; Ug. = o. This union is a partition of [a, c], and it is obvious 
that 


we 


€ 
S93 


€ 
6 


< €. - 


ES i Es ai 2 
Lis & by. 


lu -(f r+ [nl <e 
(Ls+[-% 


These inequalities imply that fis integrable over [a, c] and also that 


Peal: 


It remains to prove that, if fis integrable over [a, c], then it is integrable 
over [a, b] and over [b, c]. We choose an arbitrary positive number e. Since 
f is integrable over [a,c], there exists a partition o of [a,c] such that 
U, — L, < e. Let us form a refinement of the partition a by adjoining the 
number b. That is, we set 


Hence 


< €. 


eee {be 


(It is, of course, possible that o already contains 6, in which case a’ = @.) 
Then 
is % Lge < U,: x Us, 
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from which it follows that U,, — L,, < e. But, since b belongs to o’, we 
can write 0’ = 0; UG», where oj is a partition of [a, b] and a> is a partition 
of [b, c]. Moreover, 

U,: a ie a Uigess 


Ly = pee + Eee: 
Hence 


(Ue, = Lea) U3, — Leg ie 3S 
Since U,, — L,, and U,, — L,, are both nonnegative, it follows that 
Ogg, Stake eG, 
Uy, degs = 


The first of these inequalities implies that fis integrable over [a, 6], and the 
second that f is integrable over [b, c]. This completes the proof of Theorem 1. 
The second result to be proved is the following: 


THEOREM 2. If f and g are integrable over [a, b], then so is their sum and 


b b b 
futo-f refs 


Proof. Let € be an arbitrary positive number. By taking, if necessary, the 
common refinement o; U a2 of two partitions of [a,b], we may select a 
partition a of [a, 6] such that 


b b 
Lact Ura 
oa (9) M2 
U; [< fs ae <. 


where U“? and L” are, respectively, the upper and lower sums for / relative 
to o, and U and L™ are the same for g. We conclude from the above in- 


equalities that 
b b 
up Uy) = (| ft | s)i<e (2) 


b b ; 
Cf yay s) - UP + L)| <e (a) 


Let [x;_1, x;] be the ith subinterval of the partition ¢. We denote by M‘\” 
and M{” the least upper bounds of the values of f and of g, respectively, on 


APP.B] PROPERTIES OF THE DEFINITE INTEGRAL 671 


[x;_1, x;], and by m\? and m” the analogous greatest lower bounds. Then 
mP + mY < f(x) + g(x) < MiP + M?, 
for every x in [x;_1, x;]. It follows that 
my? a m2? =< mito A Mt < MP4 M;, 


where mt and MY* are, respectively, the greatest lower bound and the 
least upper bound of the values of f + g on [x;_1, x;]. By multiplying each 
term in the preceding chain of inequalities by (x; — x;—1) and then summing 
on i, we obtain 


LY na Lo “ Lt = Ute < Sage = yo (4) 


where U‘*” and LY*™ are the upper and lower sums, respectively, for 
f+ grelative to a. The inequalities (2), (3), and (4) imply that 


er -(['re [lee 
([[r+ [Le -0 


It follows from these two inequalities that the function f + g is integrable 


over [a, b] and that 
b b b 
| s+ea-| rt] g. 


This completes the proof of Theorem 2. 


-€. 


APPENDIX © 


Equivalent Definitions of the 
Integral 


The purpose of this section is to prove that the definite integral t Sf 
defined on page 169 in terms of upper and lower sums, can be equivalently 
defined as the limit of Riemann sums. The fact that these two approaches to 
the integral are the same is stated without proof in Theorem (2.1), page 414, 
and we shall now supply the details of the argument. The “if” and the “‘only 
if’ directions of the proof will be treated separately. 

Let f be a real-valued function which is bounded on the closed interval 
[a, b]. This implies, according to our definition of boundedness, that [a, b] 
is contained in the domain of f. Leto = {xo,...,X,} be a partition of [a, b] 


such that G=x%% =i = 


If an arbitrary number x; is chosen in the ith subinterval [x,;_1, x;], then the 
sum 


Re =D) MPG: = xi) 


is a Riemann sum for frelative to o. The fineness of a partition o is measured 
by its mesh, which is denoted by |||] and defined by 


|o|| = maximum {(x; — x;_1)}. 
oe oe 


The first of the two theorems is: 


THEOREM 1. If f is bounded on [a, b] and if lim R, = L, then f is integrable 
over [a, b] and LF = 2. |o|0 


Proof. We assume that a < 5, since otherwise L = 0 = f- f and the result 
is trivial. It is a consequence of the definition of integrability that the con- 
clusion of Theorem 1 is implied by the following proposition: For any 
positive number e, there exists a partition o of [a, b] such that, where U, 
and L, are, respectively, the upper and lower sums for frelative to a, then 
|U, — L| < e and |L — L,| < e. It is this that we shall prove. 
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We first prove that, if U, is the upper sum for f relative to any partition 
o of [a, b], then there exists a Riemann sum R for f relative to o such that 
|U, — R‘| is arbitrarily small. Let o = {xo,..., Xn} be the partition with 
the usual proviso that 


G= 425 Mees = SU = D, 


and let € be an arbitrary positive number. For each i = 1,...,n, set M; 
equal to the least upper bound of the values of fin the subinterval [x;_1, x;]. 
Then there exists a number x; in [x,;_,, x;] such that 

(6 Mie 


Hence 


0 < Mix: — X11) — FOI: — x) S FG OH — 20, 
and so 


0< 3 Pe X11) — du f (xi (ei — xi) < as 2 OG = 25-4): 


However, 


ss M(x; — xX) = Uz, 
A 


€E 


ob ee a) 4 z = X;1) = 6 & a) Ge a) = 5 : 


Moreover, >-"_,f(x;)(x; — X;_;) is a Riemann sum for f relative to o, 
which we denote by RS”. Thus the preceding inequalities become 


oe fy < 5: (1) 


which proves the assertion at the beginning of the paragraph. 

In an entirely analogous manner, we can prove that, if L, 1s the lower sum 
relative to an arbitrary partition o of [a, b] and if € is any positive number, 
then there exists a Riemann sum R® such that 

Oop ba, 5 (2) 


We are now ready to use the premise of Theorem 1—the fact that 
lim R, = L. Let € be an arbitrary positive number. Then there exists a 


\|o||0 
positive number 6 such that, if o is any partition of [a, b] with mesh less than 
6, then 


€ 
fe Shy? 
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for every Riemann sum R,. Accordingly, let o be a partition of [a, b] with 
I|o|| < 6, and let U, and L, be, respectively, the upper and lower sums for f 
relative to this partition. It follows from the preceding two paragraphs that 
there exist Riemann sums R“"’ and R for f relative to o such that 


_— ph .. 
| Us ig | < yi 
(oi = 
ne L,| < : 
[see inequalities (1) and (2)]. Since the mesh of a is less than 6, we have 


= =o 
Rr 


IL — RP = |R® Bh = 


Nl] an 


Hence 
[U, — L| = |(U, — RS’) + (RP — L)| 


< |U, — RO| + [RO — © 
E E 


a 


oe 


and, similarly, 
IL — L,| = |Z — RS) + (RP — L,)| 
< |L— RP | + [RP — L,| 
€  € 
a 9) + 4 = €. 


Thus both |U, — L| and |Z — L,| are less than e, and the proof of Theorem 1 
is complete. 
The converse proposition is the following: 


THEOREM 2. If f is integrable over [a, b], then lim R, = im - 


I|o||0 


Proof. We assume from the outset that a < b. Let € be an arbitrary positive 
number. Since fis integrable, there exist partitions of [a, b] with upper and 
lower sums arbitrarily close to iE f. By taking, if necessary, the common 
refinement o U7 of two partitions o and 7 (see the inequalities L, < Loy, < 
Usy, < U, on page 168), we may choose a partition 9) = {Xo,..., Xn} of 


[a, b] such that 
b 
é 
Oe a | i Xo 5 > 


b 
€ 
| I~ ta <§ 
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The assumption of integrability implies that the function fis bounded on 
[a, b]. Thus there exists a positive number B such that |f(x)| < B for every 


x in [a,b]. We define : 


a 


Next, let o be any partition of [a, b] with mesh less than 6. Consider the 
common refinement 0 U do. Since 


b 
i, = Loativs % | a Uoliies = Vans 


we have 
b 
E 
eee deo (3) 


b 
Pia =} (4) 


The partition o U a» may be regarded as having been obtained from a by the 
addition of at most n — 1 new points of a». Hence at most n — 1 of the 
subintervals of o have been further partitioned by the inclusion of points of 
g, in their interiors. Each of these further partitioned subintervals has length 
less than 8. It follows that, on each of them, the contribution to the difference 
U, — Uso, is less than the product 6(2B). On those subintervals of o which 
have not been hit by points of a» in their interiors, the corresponding terms 
of U, and of U,y,, are the same. We conclude that 


Us = O siteg = (n Ses 1)6(2B), 
and, similarly, 
Lsieg hee = eS 


However, 
é é 
Hence 
. 
Es ae UO siies, =. i] : 
é 
Pitas = Lig x a : 


Combining these inequalities with (3) and (4), we conclude that 
b 
lis = | 2 < 6, (33 


b 
bits (6) 


for every partition o with mesh less than 6. 


676 EQUIVALENT DEFINITIONS OF THE INTEGRAL [app.c 


Finally, let R, be an arbitrary Riemann sum for f relative to a partition 
o of [a, b] with mesh less than 6. We know that 


lL, < U, 
(see page 413). These inequalities together with those in (5) and (6) imme- 


diately imply that 
b 
|Rz — | f\ <<. 


Hence lim R, = [ f, and the proof of Theorem 2 is complete. 


I[o0 
The conjunction of Theorems 1 and 2 is equivalent to Theorem (2.1), 
page 414. We have therefore proved that the definite integral defined in 
terms of upper and lower sums is the same as the limit of Riemann sums. 


Answers to Problems 


Students sometimes make the mistake of assuming that a printed answer 
must be correct. Unfortunately, this is not always true in first editions. The 
authors will appreciate being notified of any errors which are discovered. 


CHAPTER 1 
Section 1, pages 8-9 
1. (2,4) is open, bounded; [3, 5] is closed, bounded; (—%, —2] is closed, un- 
bounded; [1.5, 2.5) is neither, but bounded; (2, 7) is open, bounded. 


meee) xX > 20rx < —2 
(e) —3<y<-—-lorl<y<3 
mee) ee 2 <3 
(g) x #a 
(h) —4<u<O0o0r2<u<6 


8. yes 


Section 2, pages 17-19 
2. The point (a, b) is symmetric to the point (4, a) around the line x = y. 


3. Use the signs of a and b. 

nS, 4/2, 24/5/20 5. 

8. (a), (b), (c), (d), (g), G), GJ), (kK), (1) are functions. 

9. (a) (2 =7,f/0) =1,f@=e@+a-+l, 
fat+b)=@4+6042ab+a+5+1, 
Kla-b=@+4+ 8h —2ab+a—b+1. 

(b) g(0) = —1, g(—1) = 0, g(10) = 34, 
P46 or a | 
g(5 +t) = ——— a a 
10. (a) f(x) = x, g(x) = a= x, a(x) = a 
(c) fx) = x, a(x) = x+1 (d) f(x) = Vx, g(x) = v |x| 
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fa) xt 3 (b) x¥ +vV2 (c) x # 3, —4 
(d) R ie) <5 (ft) x ]=—222 0 
(g) x # 0,9 


Section 3, pages 25-26 


i; 


13. 


1 4 5 1 1 2 
ob ed an -aiE Ea 
x Xx Xx x x xX 


3 2 1 1 
FT 
(d) Pee Ss (e) <-—eeo 

x ee 
. domain of f: R, domain of g: x # 0. 
(a) x #0 (bh) x 4D (c). x sei 


(d) x #0 (ec) x #0 


. (3f)(x) = 3- f(x) for all x in the domain of f. Vf (x) = V f(x) for all x in the 


domain of f such that f(x) > 0. 


.(b) v= V gr = gli 
tay eee think 3S 4% 2G 


(oe) x oe Sie 


3 
Gs ze ae 5S (b) eee SED 42 


(c) ae = 
xX 


. f + gis the set of all ordered pairs (a, b) such that a is in the domain of both 


fand gand bd = f(a) + g(a). fg is the set of all ordered pairs (a, b) such that 

a is in the domain of both f and g, and b = f(a): g(a). 

(a) the set of all real x such that x is in the domain of g and g(x) is in the 
domain of /. 

(b), (c) the set of all real x such that x is in the domains of both f and g. 


Section 4, pages 38-39 


: 


iS (bo) —1 4 (d) 2 
ero fs (g) 0 (h) 5 
(i) 3a2+ 2 G) 4x-1 

. (a) continuous (b) continuous 
(c) discontinuous atx = —1,2 (d) discontinuous at x = 2 
(e) continuous (f) discontinuous at x = 0 
(g) continuous (h) discontinuous everywhere 


(i) discontinuous at x = 1 


3. (a), (d) and (i) have removable discontinuities. 


. (a) O (b) 3 (cc) 4 (d) 3 
. both true 
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9. (a) 


00 (b) does not exist (ae | (d) a 


10. all except (vi). 


11. lim f(x) = bif, given e > 0, there exists 6 > 0 such that if x is in the domain 


rat 


of fand if0 <x —a < 6, then |f(x) — | <e. 


12. f(x) 


14. (a) 


(b) 


=f + |: 
XxX 


lim f(x) = © if, given any e > O, there exists 6 > 0 such that if x is in 


t—<a 


the domain of f and if 0 < |x — a| < 6, then f(x) > e. 
lim f(x) = +0 if, given any e > 0, there exists 6 such that if x is in 


the domain of f and if x < 6, then f(x) > e. 


Section 5, pages 45-46 


1. (a) 
(d) 
(g) 

2. (a) 
(d) 

3. (b) 


(c) 
5. (a) 


(b) 
6. (a) 
(c) 
(e) 
(g) 


rs + 1 (b. y=x+1 (c)'y = —-2x +1 
= - 5) ee - 3 (f) y= —x4+1 
y= —19% i = 4 (i) x =2 
—1 oF ea 
no slope (e) 0 (f) —4 
ae 
If colinear, then y = a 2) x + axb, — aids 
by — ay by eee 5 | 
x-intercept = —+? 
y-intercept = 10 
y=mx+b 
Fay = 3x — 1 (b) f(x) = 2 
p= —aix| +3 (dd) f@®)=x-1 
pV 3 (f) fG) =x 
or ae (h) f(x) = —§x +4 


linear functions: (a), (b), (d), (f), (h) 


7. (a) and (d) are parallel; (a) and (b), (b) and (d), and (e) and (f) are perpendicular. 


8. (a) y= —-3x +44 (b) 2y+5x =5 
fm oy — 2x = 13 

Section 6, pages 55-57 

t. {4) 6 (b) 6a 

2 4fa)-y = 6x + 1 (b) y = 6ax — 3a*+4 

3. —a°5 

4. (a) 3a (b) 2a + 3 (c) 0 
ee (f) 3a + 6a+3 


2a a’ 
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a —a l 
"(eet -@ Fu" O) aan 
a) re (b) y=5x4+4 
(cc) y=7 in; 9 xc faw% acd 
a a? 
6. ta) 2% 
7.-t6) 1,1 
9. (a) 1 (ob) 27 (c) 34°>+ 66+ 3 
~ - 
(d) aay (e-) 2a+3 eo 4— S 
he) 2 (b) 64 (ce): 132% 


12. Neither f(x) = x! nor g(x) = |x| has a derivative at x = 0. 


Section 7, pages 66-68 


1. (a) 6x +4 (b) 3x? + 2x 
(c) 2x?(x + 2) + 3x°(x + 2) (d) 4x? + 6x? — 2x — 8 
1 4 
ey Ax (f) @— xp 
2 4 saat - 
(g) a (h) SK 
(i) —2G6 — x) . 2 2 
3+ x) G) Gx +a 
Ge do 2x 4 4) =e 
(k) et (1) Ge + 1p 


(m) 2 (. — Ly 
x 


(1) @-— Webs & -— Oa — +O =e 
2y=-—-2x+4 
3. y= 2x7 — 6x + 4 
5. (a) (HAAS) = fihhh + Afthh + Abit + ALS 


6. 28x — 25y = 16 


74ay- 7 (b) gee 
x 
2 
(c) & — 1)? (d) -Z 
(e) = (f) 6u+4 
a 
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2 ta) Ge + 2, 6 (5) 5, 0 


3 2 
(c) ee eee 


(d) 5¢° — 32’, 200° — 61 (ae = 2y Ga 4 6x * 
© —* —12s° + 4 
= = I) is a i 
eee = —1 bP = 3,c = 0 
—4 
(c) 6a (d) —24a~™ 
a ea 
41 + 4 - day i 
3 — 
(: ae, V2) _ yy v3") 
wees 6x for x > 0, —x for x < 0 (b) yes 
oe t for x > 0, —lforx < 0 
14. (a) $x" (b) yes () San 
meen oo (x) = 2x for x < 1, =2 forx > 1 
ec) = 2ior x 2 1, S0 for x > 1 
(c) gis differentiable; g’ is not 
Section 8, pages 75-76 
Pee) 10x(x + 1) (b) 4(x° — 2x + 3) — 1) 
oyt — x’) 2x 
G+ iy CEs, 
2.5/3 16x(x” — 1) 
i Oe = A 


(g) (22 + 3x + 2) + 3)(2x + 3) 

2. (a) 20x(1 + x’) 
(b) 60! + 3x° + 2x? + x + 440? + 9x" + 4x + 1) 
4dr + 1S Or — 3;OR=a) 


Be 
eeaeG’- 1) (e) a : 
oe sy + 1)” @ Bont ih” =a +A) 
(h) —20(u + uw?) — Bu”) 
ix 4-6 
4. [35(2x" — 6)° + 4(2x" — 6)](4x), 17,984 
mii 2 


(@ +552 81 
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Ze 
at 
100 
== 
11. (a) 2 sinh x cosh x (b) sinh x 
Ss ee | (d) cosh (cosh x) sinh x 


13. 678 


10. 


Section 9, pages 81-83 


1. 0 
, 
3 3 
3. (a) —1 (b) three 
Ss _8+2V5 8 — 2V5 
e305 § = 3V5 
es eee (b) q 
5 1 
se. ©) apie 
2a 8 
(e) ba? + 1)2 ae 5 
(g) 1 (h) 76 
(eee GE 1 
oo 
3? 18 
6. (a) V2, -1 (by. =4,4 
2 ees (dys, 45° 
CHAPTER 2 
Section 1, pages 93-94 
1. (a) min (3, —Z (b) max (—1, 4) 
(c) min (3, —2$) (d) max (0, 5) 
(e) local max (—1, 2), local min (1, —2), p.1. (0, 0) 
| 1 26,244 
(f) local min (1, 0), local max & 5. — > p.l. (—2, 0), 
= + 3V6 972(138 + aie 
10 105 oe 
(g) no extrema or p.1. (h) local min (2, 12), p.i. (—~v/16, 0) 
ae K K 
(i) local min (2 —- > local max | — ee ae > p.i. (0, 0) 


(i) min (. =>.) 
V4 W256 
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(k) max (0, a), min (-ta, 0) (1) min (0, 0) 

(m) min (7, 0) (n) no extrema or P.i. 
/3 

(o) local max (0, 3), min (+2, —13), p.1. ( ae > = 


(p) local max (—2, —$), local min (2, —§) 


V3 2V3 V3 V3 
(q) local max (2 _ - ) = 9 : > local min \ 2 + = >— a > p.i. (2, 0) 
(r) local min (—2, —7), local max (2, 9), p.i. (0, 1) 
2 
4. (b) a (c) 9 ee 
4a 


6. joy = —x + 3 
7. f(x) = 4x+3 


8. f(x) = 


9. f(x) = 
Section 2, pages 101-103 


2. 50 yd < 100 yd 
ca 
4. (a) local min (1, 5), local max (—1, 9) 
(b) local min (4, 3%), local max (—1, 10) 
(c) local min (ja\, 2\a|), local max (—|a|, —2\a}) 
(d) local max (2, 27), local min (5, 0), absolute min (0, — 25) 
(e) local min (2, 4), local max (0, 0) 
(f) absolute min (—1, 41), local max (0, 54) 
(g) local min (0, 1), absolute max (2, 27), absolute min (1, 0), local 


see 1 
max { — , —— 
= ee 
(h) absolute max (—4, 2) 
(i) local max (1, 0), absolute min (0, —4), absolute min (3, — 4) 


= 
2 : 
6 


6. absolute min 
ae x + 4 
fae 4 yp = 2 
9-3, 3), (—3, —3) 


mee) (1, 2), (—1,-2) 
(c) 2units X 4 units [the vertices of the rectangle are the points (+1, +2)] 


11. 6 units X 18 units 
12. 1 unit X 262 units 
13. side of square = 6 in. 


14. (a) 3 in. (length of side) (b) = in. 


207 


x+4 


15. (a) Use all wire for the circle. (b) circumference of circle = 


684 


16. 
19, 
20. 
rai 
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Paddle directly to destination. 
3 +,2./2 
350 


no 


Section 3, pages 107-108 


1. (c) 3 (d) 100 ft 

2. Particle starts 100 units to left of origin (¢ = 0,v = 135,a = — 84), moves 
toward origin, slowing down, stops at t = 3 (x = 3, v = 0, a = —24), 
moves left, reaches maximum speed toward the left at ¢ = s(x = 38, 
v = —12,a = 0), stops at t = 3 (x = 43,0 = 0, a = 24), and finally moves 
to right with ever-increasing velocity. 

3. At t = 0 the particle is 288 units below the origin (y = —288, v = 144); it 
moves up, slowing and stopping at tf = 3 (y = 36, v = 0), and finally moves 
down with ever-increasing speed. The acceleration throughout is constant at 
— 32. 

4. Particle starts at x = 5 (v = 0,a = —6), moves left, speeding up. At ¢ = 1 
(x = 2, v = —3, a = 0) it reaches maximum speed to the left and starts 
slowing down, stops at ¢ = 2 (x = 1, v = 0, a = 6), and finally moves right, 
at ever-increasing velocity. 

5. Particle starts at y = 10 (v = —9,a = 0). It moves down until t = 1 (y = 4, 
v = 0, a = 18), then moves up at ever-increasing speed. 

6.0 

8. 25 knots 
25 

9, — 

127 
11. 8 miles/hr, 22 miles/hr 
12. (a) ~16 knots (b) 3:12 P.M. (c) 3 miles 
13. —2 ft/min 
4 
15. — 
Or 


Section 4, page 111 


is 


An kW WN 


(a) 1.993 (b) 4.004 (c) 1.998 
(d) 1.060 (e) 4.075 (f) 2.019 
(g) 0.062 Ch). 31.375 (i) 0.850 
. 0.1008 
5.03 
4.72 
. (a) 367 in.? (b) 39.67 in. 


<a) “296 0? (b) 205.2 12 
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7. 3.92 
8. 31.66 
9. (a) 6.93 (or 4\/3) in®. (b) 7.21 in?. 


10. 0.87 


Section 5, page 115 


1. 


(a) f(8) = f(-1) =f’) = 9 

(b) f(—2V3) = f(0) = fQV3) = f'(—2) = f’(2) = 0 =. 

() f(—4) =f(-1 = fQ) =f(-14 V3) =f'"(-1 — V3) = 0 
(d) f(—4) = fO) =f = f'(—2V2) = f'O) = f’2V2) = 0 

(e) f(2) = f(—2) = 0. f’(%) is never zero, and f’(0) does not exist. 
(f) f(—3) = fGB) = f'(—3) = f'O) = f’B) = 0 


5 V3 
: (a) 2 (b) ge 
() V3 @ ges" 
_(a) 2x” +c, c any constant (b) xi + +2x+e¢ 
1 1 
(c) meee (d) aoe aye * 


Section 6, pages 121-122 


1. 


6. 


= 


(a) (2x + 1) dx (by 7 dx 
Mg (5x — 1) + 156" + 1x = 1) 1 dx 
2 dx 
O° G+ TF 
2 2 
(f) 2v udu = 2u U dav Ss Oe 
dx 
GN rare eee 
3p + 2)do + 2a — Dade 


tx" + 2) dx 


(e) Tu? du 


(i) dx 


' (7x3 + 2x + 1)? 


— (641 + 4) dt 


"(1622 + 27)3 


1007 3 
3 it 


(a) 21607 in.? (Bb) 4629 in.’ (c) 02322 in? 


Section 7, pages 130-131 


:. 


(a) 6 (b) 35 (c) 0 (d) 3 
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V3 
(e) 3 (f) Se 
(g) 3 (i) 2 
1 : 
(d) ¢ 4 i) -=9 
ata) (b) 0 (c) O (d) 2 
5. (a) 0 (b) 
CHAPTER 3 
Section 1, pages 135-136 
l.(a x*+y=9 (b) x? + y? = 190 
(c) x*+y? 4x—4y-—-17=0 Wd x —-6x+y=0 
(ey x? yp + iy = 0 (f) x? + y? + 6x + 6y = 0 
(g) x? + y?— 8k —8y+16=0 (h) +7? 5y=0 
GQ ¢®+y-—6x+44+9=0,xX+y — 14x42 474+ 49=0 
2. All curves are circles, with centers and radii as follows: | 
(a) (0,0), 8 (b) (0,0), 4V2 
(c) (0, 4), 3 pee (d) (=z, 0), - 
(e) 2 =a) V19 (f) Gg, . ’ = 
(g) (4, 6), 5 (h) (§, 3), 4 i 
65 
(i) (3, —3 ’ io (j) 3, —? ’ ie 
6: ta) ~4y = 3x4 25 (he 32 
(c) 12y + 5x = 83 (d) 37 == 5 
7. 4x + 18y + 25 = 0 
Ie SY = DS (b) Ge = 3 er = 


ic) (+ 1% 4+ ty — & = 169 
Section 2, pages 141-142 


1. (a) y? = —8x (bh). x* se 
(c) x? = —4y (d) y? = 16x 
(e) x? = —16y + 32 (f) y? = —16x — 80 
5 = ) »= eee 
Vi yi yi 
By=oxt3,y=ex4+4 
9. ta) (2X5 9) 
10. (a) yw — BK —2y+9=0 (b) x? —2x-—4y+5=0 
(c) y? — 24x — 6y + 105 = 0 (d) x7 +2x+ 8) —15=0 
11. (a) (3,1) (b) (1, 2) 
12. (@@) py? — 4x —2y+1=0 (b) x? — 2x —2y+2=0 


(c) y? — 12x — 6y + 21 =0 (d) x7 +2x+4y—-—11=0 
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13. 


Foci, vertices, and directrices are as follows: 

(a) (0, 2), (0, 0), : ees —Z (b) uf, — 4), es — 1), SS =e | 
(c) G, —35, G, -8),y=—-32 @d ,-1),G,—-1,x = 1 
i +7, —2), (8, —2), x = (f) (—4,0), (-% 9,» =% 


Section 3, pages 149-151 


3 


10. 


11. 


For each curve, the foci, the endpoints of the major axis, and the endpoints 
of the minor axis are as follows: 


a) ©, +/5), (0, +3), (+2, 0) i) (2975-9), (23, 0), (0, +2) 
(c) (+5, 0), (£13, 0), (0, +12) (d) (+6, 0), (+10, 0), (0, +8) 
(e) (+1, 0), (4V17, 0), (0, +4) 

x’ y ~ y 

x y 2 y 

4x” 4y” 5 
(e) 33 es aa 1 


_ All curves are ellipses; for each the center c, endpoints of major axis M.A., 


endpoints of minor axis m.a., and foci f are as follows: 
(a) c (2,4), M.A. (7, 4), (—3, 4), m.a. (2, 7), (2, 1), f (6, 4), (—2, 4) 
(b) c(—3, 2), M.A. (—3, 7), (—3, 3), m.a. (1, 2), (—7, 2), f(—3, 5), (—3, —1) 
(c) c (—5, —2), M.A. (21, —2), (—31, —2), m.a. (—5, 22), (—5, —26), 

f 6, —2), (—15, —2) 
(d) c (0, —3), M.A. (0, 2), (0, —8), m.a. (3, —3), (—3, —3),f ©, 1), ©, =F 
(e) c (—2, 3), M.A. (—2, 6), (—2, 0), m.a. (0, 3), (—4, 3), f (—2, 3 V5) 


8 2b” 
. (a) 3 425 ieee 
a 
( ) = + 9v'5 = v5 (b) = + 18V'5 _ v5 
——). x = 5 gi 3 y= 5 = 3 
2 
.(a) Vaxtec?+y (b) = 
” y 
as 
bx, = b’x1x b’x,? 
_ ayy ee ee ay, ay, 2 
2axi+c 
(a) 2byi+d 
y= = Gax: + ox , Zax tom, 


200; -F a 2by1 + d 
2) f= 
36 a 20 
o-7) OG —4 


es ge 


Gay hy 
Se 


(a) ae. (b) 


688 ANSWERS TO PROBLEMS §[cuap. 3 


12. line segment connecting (—c, 0) and (c, 0) 
ae ee tee ee 


Section 4, pages 159-162 


2. Each curve is a hyperbola. The center c, foci f, vertices v, and asymptotes for 
each are as follows: 
(a)-¢ (0, Ds G5, Oj 0-88, OD, yeocber ot) de 
(b) c (0,0), f (0, +5), v 0, +32), y = + — x 
(c) ec (0,0), f (£13, 0), v (+5, 0), y = +42x 
(d) c (0,0), f (+17, 0), v(£15, 0), y = +7&x 
(e) c (0,0), f 0, +V34), v 0, +3), y = +3x 
(f) c (0, 0), f , 4), (—4, —4), D (2/2, 20/2), (—2V2, — 2/2), x- and : i 
axes (7 = 0, x = 0) 
(g) c (0, 0), f (2/3, — 2/3), (—2V3, 2/3), D (/6, —/6), (—v/6, /6), 


x- and y-axes 


= eee ee ae 

4. (a) , ao (b) m 36 1 
ge YOlase vy 8x? 

(c) eee ae (d) Ww WT 


6. All are hyperbolas. 
(a); ¢ GC, a3)ef G, —3), (4, —3), 8 Ge 
y+3=4+2(—- 1) 
(b) ¢ (1, —3),f (2), CL, —8) v C, O..41, =O) pee) 
ol + 2/3, —3 — 2/3), 1 — 23, —3 + ee eee 
(d) c(—4, —2), f (9, —2), (—17, —2), v , —2), (—9, —2), 
y+2=4¥7O04+49 


(¢) (81s, es Se (2 4 we oe av), 
(2 - 282.2 -5¥?), pany a2 

(ff) ec (1,1), f BG, 1, (-1,),00 + V2, 0, d — V2, 0, y = x, 
p= es a 

(:) c(—-2, —D, f(-2, -2 + W910 ee 
y +o, BUX + 2) 


9 2b 
8, (a) 5 (b) eae 
a 
16 5 25 13 
2 Ol x = eG (b) y= 4 e= 5 
Z 
10. (a) Veter + y? (b) |x +— 
x 2, 
¢) =-~" = 1 


11. vertical distances: 10 — 102 — 16, 100 — 1002 — 16, 
1000 — 10002 — 16, 10,000 — 10,0002 — 16 
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bx bxix bx 
12. (a) —— (b) y= - 4+ yy 
a’y; a’y a*y, 
2ax, +c 
= (a) 2by1 —d 
. (ax, + c)x 2ax, + Cx, 
oe Foy, — d a 
16. (a) circle; c (—3, —2),r = 5 
4/3 
(b) ellipse; c (—3, —4), f (—3 + V3, —3), directrices: x = —3 & eo 
3 
Bee MA. = 4, ma. = 2 
fc) parabola; f (—3, 2), directrix: y = 33, v (—3,p), axisx = -3 © 
4vV/5 
(d) hyperbola; c (—3, 4), f(—3 + VJ/5, 5), directrices: x = —3 + be : 
/5 3 
- aid , asymptotes: y —- 3 = + aE , v (—1, 4), (—5, 3), ta. = 4 
(e) parabola; f (—42, —4), directrix: x = —7#, v (—2, —4), axis: y = —3 
(f) hyperbola; c (—4, —6), f (—4 + 36/6, —6 + 3/6), 
(—4 — 3\/6, —6 — 3v/6), directrices: x + y = +3\/6 — 10,e = v2, 
asymptotes: x = —4,y = —6,v (—4 — 3V3, —6 — 31/3), 
Pa 3/3, —6 + 31/3); ta. = 6/6 
(g) circle; c (—1, 3),r = 8 
1 - ot 1 V15 
° ee —! —— os 5 2 = b) 
(h) hyperbola; c (3, —3), (3, 5 mal 5 directrices: y 5 10 
. “es: 
= \/5, asymptotes: y + = 45( - 9, 0(3. — 5 ce 
ta = V3 a 
. aia, V1 
[ay hyperbola; c (0,1), f ,1 + 10), directrices: y = 1 + a ? 
a 
a = asymptotes: y — 1 = +3x, v (0, 4), (0, —2), t.a. = 6 
V/2 2V2 
ie ellipse: c (0,1), f (0,1 2 2/2), directrices: y>= 1 = ail wl aa 
m4. = 6, ia = 2 
CHAPTER 4 
Section 1, pages 173-174 
1. (a) yes (b) yes (c) no (d) yes, 4 or any smaller number 
2. (a) 1 (b) 3 
(c) 1,3 (d) 1 
3. (a) “gy 12 (b) 3,3 
© $235 (es, 5 
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4. true 
BG) 5 (b) 4 () 4 (d) 4 (e) 1 
Section 2, pages 181-183 
i454 (b) 2 
ey 1 (d) 
2 fj 100 iy 5 (c) 87 
dG) 2S fe) <3 (f) 62 
3. (a) 85 5 
: : x a es 
(ey ba ee mee Sey 
(e) 1 x" 
5. (a) 4@Qn’ + 3n’ + 5n) On =3n +7) 
Ct = aE ne * - 
(ec) 30° + 3n’ + 5n + 3) 
6. (a) 24 £ 
() 2a? +2) ie 
3 6n 
se ee (b) 1 (d) 
8.(a) 4 2 
11. (a) 78 (b) 364 
Section 3, page 190 
igs (b) 4 (c) 6 (ys 2 
3. (a) 2 (b) 6 (c) 6 (d) 27 
Section 4, pages 197-198 
1 1 
1. (a) i x’ dx + s| x dx 
ol, : - 
(b) af Sax — | dx — 2 dx 
a Z Z 
2 2 2 
(c) a| Pa +2] cart | t dt 
3 3 3 
(d) | y eS uf ydy+ af dy 
= I; 1 : 
(e) [ tact] Sax +4] dx 
2 GES Ye @ ¥ ) 3 
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a) 5 (b) 3 (c) 14 (d) 28 
a) > py 2 os 

a) §,-4,-4 (b) 3 $2 

4,538 


Section 5, pages 206-209 


2. 


a 


12. 


Ee. 


14. 


8 4 1 


a 4 x 
(a) _ (b) 7.3.7“ 
(c) 38y"° +c ar 7 £f = P+‘ 
. 1 
(ec) 3Gx+1) +c = eal +c 
- tap 1 (b) 4 (c) 18 (d) 18 
) ¥ (f) 3(2V2 — 3v3) 
(g) —$V2 (h) 12 
(i) 3 (j) 36 ? 
(k) 3232 (1) 23 — x? + x — 21) 
r ar x° 
(m) 3 i a t (n) 4 
(o) bB-a (p) 2x(8x — 1) 
ar 1 ( 1 1 ) 
aa ee oe 


’ Lite. 
(c) (i) We have not yet encountered a function whose derivative 1s — . 
Xx 


(ii) If ab < 0 we are integrating on an interval on which the function is 
not continuous. 


. (0, 2) 
_(a) F(t) = 6 — 4¢ + 1, F’(—1) = 11, F’Q) = 17, F’() = 6%? — 4x + 1 


(b) 282 —2°4+¢ 


@) G@=xt 5 @Q=3 


ew - 142 
oa 
ae (b) 61° + 21, F(2) = 196 
-—-- 1 
(a) V1+ 2 (b) ee aE 
1 at 1 
(c) Oe 4204 1 (d) peed es t+ 
. is not integrable on [—1, 1]. 
x 
(a) 6 (b) 2? (cy. 5 (d) 4 es 
/ ] Ror . 
P14) = ae No extrema; p.i. at (0,0). Domain = (—«, ©). 
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Section 6, pages 218-219 


3 2 


1. @) S+5+xt+e (b) gx! ote 
€) 2f —? + +c (d) $y — 8y? -—3y +e 
2k = 1 (are 2 ec 
(2) -A@’ — xy? +e (h) V2Q+444+5+6¢ 
(Gi) 3s° + 35° +5) +c 


2 2 
(i) Fteforx>0,- +e forx < 0,02! + 


2. (a) cannot (bt) $x°7 42x27 +e () £0°74+7'4+c 
(d) yoy! + S8y" + 495% + 48 y + 249 + 
(e) cannot (f) cannot 
2 
@) Bi +2°-5-ax+e 
(h) zo(s° + 2s — 3) +e 
2 
(i) = ee 
(j) cannot (k) cannot 


3. fix) = 2 xe Sa 1 

4. f(y =x*+x-1 

5. f(x) = xt'+x7?+4+ 5x - 2 
6 


(a) 4 (b) 0 
ze: gi 
(c) ori (d) Vans 
(e) aan (f) 3 
(g) 0 (h) 12 (i) 22 (j) 42 


7. f(x) = 9x? + 10x + 2; f(x) = 3x3 + 5x? + 2x +1 
8. (a) eg) = x8? —3x+V72 () JOS Faeers i 
9. 28 


Section 7, pages 227-229 


Lak 4h" = 4,7: 23 
Oe ee oe (d) 0;P+=3,P- =} 


(a) § (b) 1 (c) Z (d) 4 


FO eee 
nln 


CHAP. 5} ANSWERS TO PROBLEMS 
2 4 
9. (a) [ @-xyax = %9 (b) 2| vody = 9% 
10. (a) 12 (b) 3 (c) “3 
11. (a) yes 
12. 2 
Pe 
13. 10 
5 a a b 
14, 25 f Vee -* | Vat — x2 dx, de, | Vb? — y? dy 
a = a 0 b sy 
15. 28 
Section 8, pages 235-237 
1. (a) 224 miles 
(b) min. speed 0 at tf = 0, s = 0; max. speed 75 at t = 25, s = 125 
2. (a) + hour, 74 miles (b) 285 miles 
3. (a) —32t + 640 (b) 4800 ft 
(c) 6400 ft at t = 20sec (d) 8000 ft 
(e) —640 ft/sec 
5. (a) to the right (positive direction on x-axis) 
(b) s = —2r? + 20t — 48 (c) 48 
(d) 68 fe) f= 4,6 
6. (a) v() = 32 — 2t-—1;yM =f -—f-t+1 
(b) 16 (c) 532 
7. (a) 32 miles by 20 miles (b) —22.5 miles per hour per hour 


(c) 23% hours 
9. (a) V = 17287 — 10f in.*; 172.87 min 


(b) r= a)ins _ in, 


10. (a) G, 4). G &) (b) approaches (1, 1) 
Seta) x(1) = vot, ye) = —ie (b) y= 55 —8 .?; parabola 
Uo? 
CHAPTER 5 
Section 1, pages 247-249 
2 7 
2x + 1 2x — 9 
S 5G = GD ee, es 
1 3 
ae 1 eee 
x 1x 
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2x 3(In x)” 
(g) P43 (h) aa 
1 so an 1 
Ww) xinx WW) x 45 2(x — 1) 
1 1 2(x — 1) 2x 
Ce a ee () = Ie te gee 
1 2x 
(m) - i a 
> (ay). pee (6) +2p (c) 49+ q+ 3r 
(d) ro cap Gy oat ay) (f) 6p + 3q 
4. (a) In(x)+c (b) Nnvx?+it+e 
@. Bix’ — te 4+ 2) +2 (d) 16° 4:3) 
— (x + 1) esate 
(e) 3 in |x — 6] +e (f) In “3 +c 
—| 
(g) x+In|x —1]/+c (h) eee +c 
x — | 
(i) $n x) +c 
5. (a) mo-extrema Of p.i. (b) no extrema or p.i. 
(c) abs. min (0, 0), p.i. at (41, In 2) 
(d) abs. min (42, 4 — 41n 4), no p.i. 
(e) no extrema, p.i. at (+2, 4 + 41n 4) 
6. fa) inZ (b) £1n6 (c) In35 
2 
(a) awe = 3 (e) gs * : (f) Ing 
2 
7. (a) Ing py () ¥ 
8. x. 4/3 
9. fay x-<4 (b) The integral does not exist. 
ii 2) 0°”. oe (c) O (d) —1 (e) 0 
Section 2, pages 255-257 
x—4 
2. The range of each must be the domain of the other. 
3: fa) ve (b) et +2 
(c) xe* + e& (d) —2xe = 
(ey be? nx ty =e 
x } 2 
aoe ee —3 
(g) ee (h) = 
(i) e {x = 1) (j) 51 — x) 
x x 
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(k) (6x — 4)e32* —42+5 (1) ager +e 
4. (a) ge" +c (b) er +e 
(c) x +e (d) Inje*7—e*| +e 
(e) Ein(4c +5) + ae 
Se4x 
= z°+22 
ee tC eS eae aes 
= 
ax+b 
@) exte oe re 
5. (a) In 2 (b) 3(e® — 1) 
(c) e* —e (d) e t*—e° 
6. (a) no extrema, no p.1. 
(b) abs. min at ¥ >— 1) 
e e 
1 2 
(c) abs. max at @ 1) >p.l. at (. a 
e (és 
= 
(d) rel. max at (2. _ > abs. min at (0, 0); p.i. at (2 + V2, — 
e ot v2 
(e) abs. max at (0, 1); p.i. at Ge , ey 
1 1 
pay 2 — 1) (b) 4(e"° — e’) ©) 
8. (a) 0 
10. (a) 1 (b) 4 (c) e (d) -1 
(e) 0 (f) 2 = 9 (ps2 
11. g(x) = Vx, f(x) = x2, domain g = [0, ©), domain f = (—~, «) 


a 


aa) -e" 


Section 3, page 262 
2. yes 


co mer ig 


3. g(x) = x + f(x), where f(x) is the step function defined in Problem 3, page 190. 


aia) = x° 
Section 4, pages 269-271 


aa) 4°" In 4 


In 4 
(c) In 10 
(e) a+ xIna) 


2x 


To an 
Gp 


(f) x2°(xIn2 + 2) 


696 


EP 


. (a) 


(g) e (1 — x) 


O22 ( a x) 
XG 


(k) 2x**(Inx + 1) 
(i 


. (a) —st+e 


In 7 
(cy) Ste 4 7y tcc 


(g) 


1/2 
o Hsien 


2L(x + 2)(x« —-— 4) 


ANSWERS TO PROBLEMS 


Ix —~ Fina 


() inde — 4) 


GY Gene) 


(b) 
(dj) =3 (1: 


In 3 


+o) 
x +3 x+2 x—4 


(b) x < —3,0r-—-2<2«< lox 2@ 


Inz 


2X, six 
5 


() Gx = ney 
(a) 1 (b) 


Section 5, pages 278-280 


i. 


ann Ww 


10. 


Gry Sar, 


ic} xy = c 
(ec) evy—e=cec 


. ay are 7 


(c) y= —3V x! — 7 
pos —et 


ey ae 


xv —ypae 


. (a) separable (b) 
(e) both 
(h) neither (1) 


(d) both 
(g) both 


. 1 — 2-”, or roughly = 
45 In 3 


ee years (roughly 70) 


—In2 
16 


(b) (Inx) in (In x) + ee) 
In(in x) + ') 


x 


(d) an x)"*( 
(oc) 2 


oe 
(by f= vi a > +e 
(dd) y +1 = e+ IF 
Qo Mea y= 2 
(b) y = 5e(3/2)x 
(@) x°+>)° = 2 


(c) neither 
(f) separable 
neither 
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11. 10’e'"**(roughly 1.26 x 10") bacteria 
ee. =~ sec (roughly 0.2 sec) 
13. 4 
14.y=ce "+ 2 
a 
oo } = _ = 3 
CHAPTER 6 
Section 1, pages 288-289 
So ae 
aa 0-1 (b) aa 
oe: : 
(Cc) —1,0 iS = 
ae ND. yD 
an (f) , 
2 2 2 2 
2. O COs sin 
I = =e 
II - a 
Ill _ - 
IV +- — 
3. (a) ¢t = no, n any integer (b) 5 (2n + 1) 
(c) w(2n + 34) (d) 2nr 
(e) r(2n + 3) (f) r(2n + 1) 
(g) no values of t (h) no values of t 
4. domain: R; range: [—1, 1] 
5. (a) +1 (b) O () 0 (d) +1 
V6 + V2 V6 — V2 
6. (a) “SS i 
6 — 2 V6+ v2 
a ee oma 
4 4 
8. (a) cos 2a = cosa — sin’a (6b) sin Za = 2 sin a cos a 
cosa = cos’ © — sin? 4 (d) sina = 2sin—cos-= 
Zz 2 Z zZ 
9. costa = 3(1 + cos 2a), sin’a = (1 — cos 2a) 
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Section 2, pages 299-300 


1. (a) 1 (b) 1 
(c)--1 (de 2 
1 
ey 68 ee 
T 
(g) 1 (h) © 
a ee: Gj) 3 
2. (a) 2x cos(x? + 3) (b) —e*sin e* 
(c) cos 2t (i-D 
(e) —cos x sin(sin x) (f) — e 
sin x 
oe ee 5 1 
(so) 23x Si xX Coe (h) ——— 
cos?x 
1 . ae 
A eee ce (Gj) e (cosx — sin x) 
3. (a) 4+sin7x +c (b) 4sin2x — 4cos3x +c 
(c) sin(e*)+ ¢ (d) —cos(x + a) +c 
(e) —e Sint 4+ © (f) sin(sinr) + ¢ 
(g) —In|cosx| + ¢ (h) —Zcos’x + ¢ 
(i) 4+sin’x + c¢ Gj) x+e 
Cie x 
4. (a) ————— +c, ifn # —1; —Inlcosx| + c,ifn = —1 
n+ 1 
- n+l 
(b) a on ie ee 1 i 
n+ 1 
3 7 5 
() -cosx$——+ @ =a 
3 i P) 
5. 3(1 + cos 2x), $x + 4 sin 2x +c 
6. 4(1 — cos 2x), 4x — gsin2x +c 
tt) 2 = —cos(e*) +e 
(b) siny = —cosx +c 
(c) sinx + $cosy?=c 
Stayt (b) 0 2 ee 3 (e) 5 eS 
Section 3, pages 309-311 
1. even: cos, sec; odd: sin,csc, tan,cot 
3. (a) 2sec*x tan x (b) 4x sec?(2x* — 1) 
(c) tan x (d) cosy 
(e) 2sec*t tant (f) —3x°csc(x®? — 1)cot(x® — 1) 
1 
ey ® Yee 
5. (a) #In|sec 5x| +c (b) In|sinx| + c¢ 


(c) tan(e?) +c (d) tanx -—x+e 
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4 
(e) ttan’x +c (f) = +c 
(g) —2esc/x+e (h) —4cot’x — cotx +c 
6. Inlsec x + tan x| +c 
7. Injsec x| + ¢ 
9. (a) 1 (b) 4 
10. {a) 2 (b) (lines are perpendicular) 


ee 3\"” Rea 
() -2 - (—-— 
Ba) (z-) 


11. The functions, domains, and ranges are as follows: 
sin: R,[—1, 1] 
cos: R, [—1, 1] 
a: x > 5 (2n +1), R 


cot: x = n7, R 
Sec: x #5 Qn + Pei Sl 


csc: x # nt, |x| > 1 (nis any integer) 


12. (a) 1 (b) 1 
13. (a) 4 (b) ot (c) 3 
(d) 2 (ey: 1 (f) 0 
Section 4, pages 321-323 
ud T 
| 3 (a) 2 (b) ; : 
(c) 3 (d) -% 
(e) 3 (f) 6 
(g) 6 (h) : 
(i) a+ 2nr, nan integer such that — zs <a+2nr < ; 
— Si 
(j) 7 (k) _ 
(1) ; — (a + 2nm), n an integer such thatO <a-+ 2nm7 <7 
(m) -—1 
(n) 2x + 2n7, nan integer such that — : < 2x Zar < 
Ll T 
0) (p) 3 
2. (a) os (b STE eee 
V1 — x! 2/x(1 + x) 
1 
Se (d) 1 
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yi 
ey. =i — 
jx|((1 + x°)V/x?2 + 2 
6x” 
(g) 0  qeaee 
1 
— 
3 ee a” 
3. The domain and range for each function is as follows: 
@) || <1 ieee (o) x 20,0 p= 
C) 220. (d) RO<y<m 
) BVA y¥ =e Co ais Ds Gu 
1 37 T 
(g) ahs (h) ae oe 
iy x > @, -o G) |x}=1,y=0 
4. (a) 8 arctan = +c (b) arcsin a) +c 
(c) garctany? +c (d) arctan(x + 1)+c 
(e) arcsin(y —1)+c (f) sarcsin(x + 1)? + ¢ 
(g) arcsecx +c (h) 4arctan(x? + 1) +c 
vs vi 
V3 x 
(c) eee (d) 3 
(e) arctan x (f) 7 
8. (b) R, (0,7) 
9. arctan ¢ 


Section 5, page 326 
1. (c) and (f) are polynomials 
3. 4) Fa,y) =o - bee 
(6) Fi) = ae 
co raw = Fa = 
GD Jay) = eee — 1 =4 
(e) F(x, y) = y — Un 5)x 
G) Fa.) =O = ay = 


Section 6, pages 334-335 


2. fa) 10-7 (b) —4 
(c) 6 Y ——F. 
(e) 2(a? + 5’) (f) —6-+ i29 


@) -#+i% (h) —$8 + 188 
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=e 2ab 

. eo - + = iia . ae | wee 4A 

(i) (a2 + B) = ae 3@ +B) (j) ae 
(k) 3+ 74 (1) 39 + i4 

4. (a) 2+ 13 (b) 5 — i4 (c) 7-i (d) 22-417 
(e) 8+i12 (ff) —7 (g) —2i 

Section 7, pages 343-344 

1. (a) V2 a (b) v2 ae 
(c) _ (d) a 
(e) Se” (f) ~— 

(g) 5 (h) 

2. (a) ee (b) ee’ | 
(c) — (d) ee 
(e) ate (f) ere 3 

3. (a) (-1,0) | ay (V2,V2) _ : 
(c) (-Vv2, —V2) (d) (V2 — 1, V2) 

(e) G,5) ee, 2), (3, —2) 
nile" 

eK = 01,2 ee” k= 0,1 
ili os = 012 (ay ce -kiO pt, 2, 3 
(e) V2 2 Exp : (asa A: ae) = 0,1, 2, 3,4 
a I a. 43 2a) inky > 

g 9 

Section 8, pages 351-353 

ee y = ce °)* (b) y = ce 
meee = ce °* + coe” (d) y = e(k, cos 2x + ke sin 2x) 
mee = (cx + ¢)e™ oy eer oe” 
ey = ¢) + coe” (h) y = ky cos 2x 4+ ke sin 2x 
Gi) y = ce + coe 3 Gj) y= ce 182 
(kk) y= a+ ce 

a a 
— y= ogee (x: COS be x + kesin > x 
(m) y = e-™(kycosx + kesinx) (nm) y = (cx + cic 

2. (a) y = (cix + ca)e* (b) y = (x + 2)e* 

3. (a) y= esx + Je-* (b) oui (3x 4. Dee 
in y= 5 — 2e-** (dj. vy = Se * cos x 
Bw a2 (dina ty, BY 5 YH) 

= ¢ cos ~~ x F 5 

81,5 


Bay y= cie™* + cre™ (b) A= 2p = 
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Section 
+ i 
(b) 


(b) 
5. (a) 


6.16) 


7. (a) 
(b) 
(c) 
(d) 


(e) 
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CHAPTER 7 
1, pages 361-362 


xsinx + cosx +e 
e 3 Fas +c 


xarctanx — $In(1 + x’) +c 
x? 


2 
eee zp xsin Ix + cos 7x +c 


7 343 
xInx x’ 
ee 
—(x? — 13x + 2)cosx + (3x? — 13)sinx + c¢ 
zae*"sin 2x — e**cos 2x + € 


biases tan x + In|sec x + tan x|) + ¢ 
308 + Iin(& + 1) — x3 + $x? —Gx+e 


x In(’ + 2) — 2x + 2/2 arctan ee +e 


a az b az. 
2c eee ee sin bx + ¢ 


—h alas 
rare 2 * cos bx + ee sin bx + ¢ 
5x + Zsin2x +c (b) 3x —Zsin2x +c 


(S25 (Injax + b|)” — nf dnjax ae 3 ie dx +c 


e*(x° — 5x4 + 20x? — 60x? + 120x — 120) + ¢ 
— Zz sin’x cosx — 2sinxcosx + 2x +c 

z sin = — 5 sin’ 5x as c 

)’ + 30(In|4])* — 120(In|A})° 


+ 360(In|A])” — 720 In|A|] + 720x + c, where A = 


2 (f) —2e+6 


Section 2, pages 372-374 


1. (a) 


sin 2x sin 4x 


g cos 3x — z;c0s7x +c (b) 


4 8 
sin2x  sin4x sin3z  sinliz 
ee. : Soe Fee 
= = aye cos(r a a = 
Z tr — 3 r+ 3 
cOSmy cos 3my sin5x sin 7x 
Qe a. (8) Fo 14 


cos 10w cos 24w 


x +7 


3 
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2. ta) 
(b) 


3. (a) 
(b) 
4. (a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 
(h) 


(i) 
(j) 
5. (a) 
(b) 


—cos@ + $cos*? +c 
= zs cos 30 + ¢ 


4cos 2x + 4 cos? ay — z/5 cos® - +c 
=79 sin* 2x cos 2x — 2 cos 2x +7 0os 2x +c 
5 sin 22 — <sin'2x +c 
ar 42 sin 2x + Bos! x sin x + ¢ 
-. ers 1 sin 12x + c¢ 
gz cos? x — ge cost? ses 
39 — ae +c 
= = 82x — F ee + 2 sin!!?x + ¢ 
— rhe sin’6x + 545 sin 6x + xx + gg sinl2x +c 
= sin'y — 2 iat 4 samy c 
= ex sin 4y — gg sin’2y + 
16 
3, cos!!2x — 2cos’?x + ¢ 
46+ 4:sin 20+ c¢ 
4cos@sind + 40+¢ 


6. $tan’x + c, $sec’x +c 


7. (a) 
(c) 
(d) 
(e) 
(g) 
(h) 
(i) 
(j) 

8. (a) 
(b) 


10. (b} 


11. (a) 
(b) 
(d) 
(f) 


4tan’x —tanx+xt+ec (b) $tan*4y + ZIn|cos 4y| + ¢ 
3 tan @(sec?@ + 2) + ¢ 

4a(tan 2x sec 2x + In|tan 2x + sec 2x!) ae 

7 tan’x + 3 tan’x +c @) 5 sec’x — 3 sec’x + ¢ 

+ sec’x tan x — dsecxtanx — dI1n|secx + tanx| +c 

% tan’x + % tan’x +c 

37, tan!!?x + 4 tan’?x + 2tan’?x + ¢ 

—In|csc x + cot x] + cosx + ¢ 


1 = 2 
Jot xa = ze 7 cot” tx — | cot” *xdx 
i 


—;4, cot!30 + $ cot?3@ + 4 In|sin 36] + c 
—+ cscty cot y _ is csc’y cot y _ fs cot y +c 


—} csc*@ cot 0 — $csc cot 86 — #In|csc 6 + cot 6| +c 


4sin3x +c (c) po ee es 


—zcot'x — $cot?x + ¢ (e) gcsc®2y — zh cscb2y + ¢ 
—icsc’g cot @ + ¥csc Pcot d + Fln|csc gd + oss ¢| +c 


Section 3, pages 382-384 


Zz 


1 Ss 
1. = 1 (aver — x2 — a’ arccos a he 


2. (a) x = {al cot 6, Va? + x? = |al csc 6 
(b) 0<@0<7 


3. —In |csc 6 + cot 6] +c 


“[-Pueey 


2 
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5 ta ee 3 arcsec (3 ) +e 


(b) dyin/x? — 1 — BxV x? — 1 +. 8 Inix + V2 = i de 

C} S306 Sa) nee 

(d) ee — 4 — pyx?V x2 — 4 — xV x2 — 4 — 2Inlx + V2 — 4 te 
—" 

(ce) 41n cn Fe 
x 

(f) dxQ? 4 4)" + Bev £44 bine eee 

Cee ae 

(h) =v Aa x) oo 

(i) BxV2 — a — $a aig 


Gj) ta ae 
250 a 5 
6.a b — arcsin <= —(a—- h)V 2ah — hf? 
7. (a) In3 (b) 1 © 5 
a Sa 37 1a 7 + 2V10 
(d) ak (e) 3 (f) 3In 9 
5 Pee pei 3.” 
8. (a) = arctan 5 +o¢ (b) garctan 5 +c 
(c) arctan ae +e (d) 4arctan(x + 3)+c 
x 3 oe 
(e) 1262 + 3) Sa ae arctan ————— =e 
x—2 F — 2 
(f) oe =z arctan = = +e 
x x ; 
(g) 3602 + 92 “- 216(x2 + 9) T a8 g arctan = +c 
ese : + Sarctan(x + 1) + ¢ 


A(x? + 2x +2)? 9 802 + 2x + 2) 
Section 4, pages 394-396 


io) Ota 
© x-sa4t+o5 @ —34+555 
(e) be 74 5- (f) | a 

2. (@) nft=3] +6 (6) nZt De 4 
(c) 2 in 5 oa 
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(d) 


3. (a) 
(c) 
(e) 
(g) 


(h) 


4. (a) 


(c) 
(e) 


(f) 
7. (a) 
(c) 


(d) 


(e) 
(f) 
(g) 


(h) 
(i) 
(j) 
(k) 
(m) 


(n) 


In |(x — 4)V x? + 2| — ¥? arctan at +e 
—1 2 —3 3 
mea yi (b) aes al 
1 2 5 3 
a ( — 1): (d) pass 
—| Ps 1 4 2x — 3 
eee 1 (f) ea pane ria) 
a 2 i day 
eee Xt 3 Oe OP 
4 5 ax | er 
2x — 1 (x2 + 1) x2 + 1 
meet 9)" en 
In = ee (b) al ae, +c 
Injx — 1|-— +e ey Sin lx) + 3x Sess 
In[(x + 1)? + 4)] — 3aretan> + ¢ 
2 1 

i ce. ae 
| ~ ee + ee eer", 
Pe 1 eS : - (x — 1) 
Zin (x’ +1) —In|2x + 1| +c 


3in |x — 2| — 2in|x— 3|-——5 +e 


arctan(x + 1)+c 
In(x” + 2x + 2) — arctan(x + 1)+c¢ 


4 Jn |tanx — 3| — $In|tanx — 1| +c 
2secy + 1 

alice ea en 

ia secy + 2 owe 


V3 V3(2y + 1 
y — $ln(y’ +y +1) +P arctan Pt” + 


J/2+2+1 i 5V3 arctan = 5 a 


In 


iz + 4) 
ie -— Ne? jx + 1) 
In BeetT | +e dq) x«- tin jaoy ef 
2 
gia (2x — 1) + ———— wan — In(x + 1) — arctanx +c 
2x + 1 Fy 3(2x + 1) 


S61 = x -F 5) 


12/19 baie ch By) 
+- 6859 arctan es eEeS +c 


38(x2 + x + 5) 
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Section 5, pages 402-403 
3. (a) 20 + x) °- 80 4 x)? +. 
bee 4 x Se 
(b) ee ee 1) +c 


5 
JA 3 =. 
(c) secx —tanx+x+ec (d) a eE eee tte +c 
2 sin x + cos x 
V/3 \/3 sin t ; 
Be. 1% 13/15 
(e) 3 arctan a ee ee (f) <73d + x) +c 


(g) a = 0 Fe a = ingy!® £ 1)° cece 

(h) 4sin’x +c 

(Gi) ade (3(5x + 3)? — a05x + 3)"? + 18x 4 3)"”) 4 c 
= 1 


Gq) ——. + ——— + 
31 + Vx) 206 + Wx) 3 
Vi+e—-1 
(k) 20 +e")? + ) jae 
: : vitezi-— 
(c) 05 (d) 44+ 2in3 


Supplementary Problems, pages 403-404 
1. 4(2x” — 1)sin 2x + 5x cos 2x + ¢ 


Zz } arctan +c 


3/2 


— A+ 1)" +e 


2 


7. e+ ie ee 


3 Z 3 2 
t t t t 


9: (va =e t arcsin*) Sk 
10. 3eV4 + x2 + 4In[x + V4 + x21) + ¢ 


11. ¢ tantx — 4 tan’x + In |sec x| +c 


12. —s5 cos!!2x a i5 cos2y — 30 cos>2x + Cc 
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13: 5 Vax +1+2injdx + V4e41| +c 


14. 3(e7 +e“) +c 
15. 2x!? — 4x14 4+ 4InG!4+4+ 1) 4+.c¢ 
16. —ZIn|y| + §lnly? — 4] +c 
17. S5e“(sint + 7cosf) + ¢ 
18.x-—Ind+e)+c 
19. —Scot(2x + 1) + $cscQ2x +1) +c 
20. Inlx + V4 + x2] +c 
21. Inj2 + Inx| +c 
2 
22. Z arctan a +c 
23. $in(e* + e-*) +c 
24. y[diny)? — 2Iny+ 2] +c 
25. —cosx + $cos*x — ¢cos’x + c¢ 


3 3 
26. (2 + «)mx-(Z+2) +0 


27. ¢ln|7 + 4secz| +c 
28. $x — 7% InQe* + 1)+¢ 


n+l oe 


x 
eg Gti ° 
30. 3dn x)? + ¢ | 

31. $sec*y tany + 2sec ytany + $In|tany + secy| + 
x+1 


x—-—1 


Ya 2 


+c 


meee inix — 1| — 2x + In 


33. S$siny + 7gsin9y +c 

34. —;4, cos 8x — gcos2x + ¢ 

4 

x + : — 

36. arctan(x + 2) +c 

— z 

oS dy | * 

38. et(x® — 5x* + 20x? — 60x? + 120x — 120) + ¢ 

oe 4a — x)V 2x: — x2 4 arcsin(l = x)] +c 
= 

ei S 


Al? 
41. 2arctan as +c 


35. 41n 


37. 


40. 
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3 
42. = — 4x + 8 arctan +c 
43, $x — 22 In]3x + 5| +c 
44. ese + ¢ 
- 
i 
46. g1n|1 + 4sinx| +c 
47. 4x" arccos x = Ex'V/1 — x3 — 2V1 — ee 
48. In\(¢ — 5)?(¢ + 2)3| +c 


49, 15(3x9/5 — 3x8/5 4 16,7/5 _ 82y6/5 4 256, 
— 256x4/5 + 4096 3/5 _ 8192x2/5 4+ 65,536x"/5 
— 262,144 In|x!5 + 1|) + ¢ 


45. 


5 4 3 2 


6 
x x x . x 
a ee a ee 


CHAPTER 8 
Section 1, pages 411-412 
1. (a) 4 (b) 0 (c) In2 
oo (e) 2 (f) -1+%In5 
n— 1 T 


4. 5 sec, 80 ft/sec 
5. v(0) = 90, v(1) = 96(max), v(4) = 42, average = 83, number of words = 19,920 
6. 182 ft, 94 ft/sec 


7. (a) = (b) 3V2 (c) 0,2 
oe 
(d) In (2 =) (e) arccos = 
9. x(t) = ee = 


Section 2, pages 420-422 
l. (a) 3 (b) = (c) 3227-1) @ 2AW2-1) 


3. (a) Ts = $3 = 1.34375, | =4 (b) T% = 12 = 4.75, i = 14 


(Cc) TM = 12, | = 12 (d) 7, = 444 = 0.705, | = 2 

(ce) T, = 7356 = 0.697024 (f) T, = 3323 = 0.782794 

(g) Ts = 0.742984 (h) 7, = 45; = 0.608108 

(i) T, = —4223 = -—0.565588 (Gj) M= 3m + 16V2 + 12 83551 


24 
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5. (a) 
(c) 
(e) 

6. (a) 
(b) 
(c) 
(d) 


(e) 


Section 


16, 151, 15,001 (b) 21, 201, 20,001 

no error (d) 5, 41, 4083 

961, 9601, 960,001 (f) 5, 41, 4083 

Tip = 0.835023, T1900 = 0.835642, Ti000 = 0.835646 

Typ = 0.784981, Ti00 = 0.785394, Tioo09 = 0.785396 

Ts = 1.11499, Tso = 1.11148, T5009 = 1.11145 

See 1.7054, 7, = 1.83551, 73 = 1.84784, Tig = 1.85091, 
Tipp = 1.85191 

Tip = 0.746211, T1009 = 0.746818, Tino = 0.746822 

3, pages 431-432 


M, = 8% = 1.32812, | 4 


1. (a) = 
(b) M, = 4, | an tf 
(c) M; = 0.648146, | = 2 i 
(d) M, = 1.36616, | == In A = 1:3363 
(e) My, = 2.79961 @)  Bfy = 344159 
(g) My, = 0.748747 (h) M, = 1.10867 
eee 24 = 1.34375 i Ty =— 4 
meee, = 0.705 (d) 7, = 1.42809 
(e) Ts, = 2.79306 (f) Ts = 3.14159 
(zg) Ts, = 0.742984 (h) 7, = 1.11699 
eee 4 — 1.33333 (ob) $1 = 4 
(c) S4, = 0.671481 i) Se = 1.39162 
(e) S4 = 2.79734 (f) S4 = 4.18879 
(g) Ss = 0.746855 i) se £11136 
8. (a) 301 (Trap), 214 (Midpt), 31 (Simpson) 
(b) 321 (Trap), 164 (Midpt), 29 (Simpson) 
Section 4, pages 442-443 
E 32 
2. 4r 
3. 20 
4. Sra’ 
5. Sab" 
387 1287 867 
is Oe © 45 
Tt 167 
(d) 74 (e) cs 


710 ANSWERS TO PROBLEMS | [cuap. 8 


157 Or 
8. (a) = (b) > 
Or 5127 207 
9. (a) > (b) =e (c) xo 
(d) 2rin2 (e) 5 
An 
10. = 
h+a 
H.-& in| xVa? — (x — h) dx 
h—a 
(b) 2n'a’h 


12. ra’h (2 7 7 


9 h 2 
13. mah @ ea ) 


Section 5, pages 450-452 
1. 10,000 ft-lb 

2. 60 ft-lb 

3. 10 ft-lb 


d—b 
Me M, Ms M, 
8, (a) Gm e. x? = -) (b) -cm| ee r= x? = = dx 


Section 6, pages 459-460 

1. a, b, c are integrable 

2. a, b, d are integrable 

3. (a) 16 (b) 4 (c) 10 
5. both limits are 0 

Section 7, pages 470-472 


1. (a) improper, convergent: 9~/2 (b) improper, convergent: 2 


T 


(c) improper, divergent (d) proper: 


CHAP. 9] 


ANSWERS TO PROBLEMS 


(e) improper, divergent (f) 
2. (a) improper, convergent: $ (b) 
(c) improper, divergent (d) 
(e) improper, divergent (f) 
(g) improper, convergent: 5 (h) 
4. convergent for s > 1, divergent otherwise 
5. (a) improper, convergent: 6 (b) 
(c) proper, (d) 
(e) improper, convergent: 4 (f) 
(g) improper, divergent (h) 
(i) improper, divergent (j) 
7. (a) improper, convergent (b) 
(c) improper, convergent (d) 
(e) proper (f) 
foe) = 1, F'(1) = ; 
CHAPTER 
Section 1, pages 480-481 
om) 1 (b) 1 
(d) 0 (e) e 
2. (a) 1 (b) 
(c) « (d) 
(e) 0 (f) 
3. (a) no limit (b) 
(c) 1 (d) 
4.0 
mee 1. 0, 3, 2, 5, 4, 7,6, 9, 8 
"if nis even 
(Cc) Sp = 
n, if nis odd 
Section 2, pages 489-490 
Poa) (b) 
(c) does not converge (d) 
(e) does not converge (f) 
(g) does not converge (h) 
2. converges to 2 
6 1 
7. Gm — lim ap 
ota) 1 (b) 4 


proper 

improper, divergent 
improper, convergent: 
improper, convergent: 
improper, convergent: 


improper, convergent: 3(/4 — 1) 
improper, divergent 
improper, convergent: 
improper, divergent 
improper, convergent: 


Zz 


1 
improper, convergent 
improper, divergent 
improper, divergent 


9 
cy OD 
a) 4 
1 
does not converge 
0 
1 
2 
i 
4 
alc 
2 
does not converge 
7 
= 


(c) does not converge 


712 ANSWERS TO PROBLEMS | [cuap. 9 


Section 3, pages 497-498 


1. (b), (e), (g), (h), G), G) are covergent 
2. both are convergent 


Section 4, page 502 

1. (a), (b), (d), (e), (f), (h) are convergent 

3. (b), (e), (f), (h) are convergent 

5. (a) 4, 77 10;20 (b) 9, 99, 999,10° —1 (c) 20, 200, 2000, 2 x 10° 


Section 5, pages 510-511 


1. (a) conditionally convergent (b) absolutely convergent 
(c) conditionally convergent (d) absolutely convergent 
(e) divergent (f) absolutely convergent 
(g) divergent (h) absolutely convergent 
3. (a) divergent (b) conditionally convergent 
(c) absolutely convergent (d) absolutely convergent 


5. (b) no limit 


Section 6, pages 518-519 


iia) 2 (b) 1 icy-4 (d) 1 
iced Ors (g) 1 (h) 3 
2, 1) {= 2e2) (ob) 15, i 
os 1-4 (hy: {=3.9 
3. false 
ee (b) x (c) 1 
(d) 0 (e) 5 (f) 2 
a. G) {1,.9) (b) (—, w) ce ¢5, +4) 
(d) x=1 fe) -{—6, 4) (f) (—2, «) 


Section 7, pages 526-528 
1. (a) r = 1, domain = [—1, 1] 


ra) t—1 


(b) BS tF= 


jet 
(ej {1,2 
2. (a) +r =.2,demam = 7,4) 


00 i—l 
o) P&P a2 


(c) (0,4) 
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11. 


: f@) = 1. domain = [1, 3) 
(b) D> Vie -2)"r=1 
2—1 
(c) (, 3) 
. (a) r = 1, domain = (0, 2) 
ix — 1)" , 
(b) SS Ee 
2 Vit+1 
(c) (0, 2) 
(—~x«, «) for both/ and g 
0,1, 1, 0 
(b) true 


Section 8, pages 538-540 


1. 


(a) 


(b) 
(c) 
(d) 


(e) 


. (a) 


(b) 


. (a) 
. (a) 


(b) 
(c) 
(d) 


ta) 


» (a) 
+4. 


2 
fo=3,.T.=$-34h-$-B+ 5 : 
et. fo = | — 7, SS XO 
ie 2 8 4 2 — 1) + ae - 1 
Mm = 2+ 4 — 3), % = 24+ 46 —3) — 4 — 3» 
Tz; = 2+ 4(x — 3) — gg — 3)” + sta(x — 3) 

/2 /2 ve | 4 
ee gg ap 
/2 <a ") et =) 
oe eee ee ee 
Rd Z 4 4 4 
A 
es [= neven 
ax) = oS ia where k = 4 
= 
oD - beau 
n (=47 he — 1 
— — Sc aE ae ig = 
T(x) = 0, Tr(x) du n>1 
x” x 
litxwiltrxers (b) li,l- > 


T,=94+7«%—2,%=9+7% —-24+@-—2P% =T; 
fe = 8 to = Oe ee 2 Sx? + 2X 

7 = 2 + x + Oe = 24 x + tH 

Pe G0 1) ae — ly 2 = 3 oe DS a 


4-2 — D+ ae — 1? — AO — 1? 


B+ 11(x — 2) + 12 — 2? + 6% — 2)* = 6x? — 24%? + 35x — 19 
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CHAPTER 10 
Section 1, pages 548-550 
1. (a), (b), (c) are parabolas (d) straight line 
(e) cubic (f) semicircle 
(g) line segment (h) ellipse 
2. (a) straight line (b) parabola 
(c) line segment (d) hyperbola 
3) JO = 2 4 (b) f(x) = e” 
(c) fx) = 3V4— 2 d) f(x) = Inx 
4. (a) y—x=0 (b) 3" = Ss 
ie) x? — 7" = 4.x SO 
ies 2 (b) —32 (cy 1 (d) 5 
8. (a) 0 (b) —1 (c) 3 (d) —1 
= dx dy = x'y! a pix! 
9, (a) i = a etc., then — = = ples 
(b) § 
Section 2, pages 558-560 
3/2 
1. (a) ar(10!" = = ) (b) 24 
() fp" — 1) @ = 
2-3 
3/2 
3 @) fy (io — 8 ) (b) a 
(c) 4(V2 + Intl + V2) Ca ; 
ve 
3. tb) (c) | V4y? — 8y + S5dy 
0 
6. 42 
7. (c) 9/2, 22 
Section 3, pages 567-569 
Ea f= 2, 4) ie 2-3) jo = (d) (45, 35) 


(d) (5,4) (e) (8, 2) 
3. (b) V29, V2, 35 


4. true 
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6. (b) (V3, 8) 
7. (b) 6V3+4+ 4,1) 
9. (a) true (b) true (c) false 


Section 4, pages 576-577 


1. For each value of to, P(to) and the coordinates of dP(fo) are as follows: 
(a) —1:(—2,1), (1, — 2); 0: (—1, 0), C1, 0); 2: C1, 4), (1, 4) 
(b) —1: (2, —2), (—2, 1); 0: (1, —1), (0, 1); 1: (2, 0), (2, 1) 
eee —1. 0), (1, 0); 1: (0; 12, Gs:3); 2: C, 8), G12) 
feat. ©). (1, 1): In 2: 2, In 2), 2, 1) 
(ec) 0: (3,0), (0, 2); ; : (8V2, V2), (-3V2, V2) 


(f) — & a, ii), (—2, ay 0: (0, 0), (0, 0); e: (4, 8), (4, 12) 
(g) =it 2, 2); (1, ay; 0: (=0; 4), (i. 2)5 ,: (0, 6), (1, 2) 
2. The coordinates of dP(f) are as follows: 
(a) (2t, 32°) (bt) GE — ec), He +e) 
(c) (22, 3(3¢ + 1)'”) (dd) (2¢+1,°+ 2A 
3. Coordinates of dP(@) are (a(1 — cos 6), asin 0) 


6. P(to), the coordinates of dP(to), and equations of the tangent lines are as 
follows: 
(a) (6,3), 4,1),x -4vy+7=0 
meee. 1-0, 1), x = 1 
(c) (2,1n 2), (2,1),x —2y —2+1n4=0 
(d) (0,0), dP, and a unique tangent do not exist at fo = 0. 
a y = 3x 
(b) At t = 2 the vector space forms the straight line y = $x. At ¢ = O the 
vector space consists of the zero vector at the point (0, 0). 


8. (a) y = xf"(@) — o'@ +f/@ 
(b) dP(@) = (1./'@)pwi ¥ —f@ = f'@& — a) 


Section 5, pages 588-590 


2. (a) parabola: y = x? — x 
(b) v(t) = (1, 2¢ — 1)Pw, 4 
P(0) = (0, 0), vO) = (1, —1)pq@, |v(0)| = V2 
PQ) = 0,0), v0) = 0, Dew, |v] = V2, 
P(2) = (2,2), v2) = 1, 3)pq, — {v(2)| = V/10 
(c) a(t) = (0, 2)pq, perpendicular when t = 4 
2 


(d) vi | A/T Bp 4 di 


0 
WI 
@) #10 + + “mn ( > —— 
eee. tepes 
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3. (a) P(t) = (800r, —1627), v(t) = (800, —324)pw, a(t) = (0, —32) pq 
(b) 25sec 
2 
peat 3 
(c) parabola: y = aa 000 


(d) 32 | 625 + dt 
0 
4. (b) v(t) = (2t — 4, 2t — 4)pw, |v(D| = 2V2|r — 21; 
lv(0)| = |v(4)| = 4V2 (max), |v(2)| = O(min) 
(c) straight line and point (d) 8/2 
3; (b) v(t) z= (7t 3t7) Pay ae 
P(—2) = (4, —8), v(—2) = (—4, 12)p_»), |v(—2)] = 4V/10 


P(0) = (0,0), vO) = (0, 0)p., lv(0)| = 0 
PQ) =(1,1), vd) = (,3)pq, lv(1)| = V13 
P(2) = (4,8), —-v(2) = (4,12)p@, lv(2)| = 4V'10 


(c) a(t) = (2, 6f)pw, a(—2) = (2, —12)p(_»), 
a(0) = (2, 0)p@, a1) = (2, 6)pqay, a(2) = (2, 12)pQ) 


x y 
6 (a) wl Bane 
— 78 

k 


Section 6, pages 600-601 
t. (a) 292 4 N70, 0 (c) x +y ~4y=0 


25a) 4, 2 V3 2a 
(c) about x-axis 


5 
a" = ae 
6. (a) circle - (b) straight line 
8 7 Sa 

oe) Wats S=4 (d) y= es? 

(e) y? = 2x +1 (f) ‘x? +. pare 25 

(g) y = 3x (h) (& — V3)? + Qy? = 4 


Section 7, pages 611-613 
211 21 


i a o) = () =*+9 (d) 4 
147° 
2. (a) 4 (b) 16 (c) ex 
d) +s @} 
oe (o). 2 (c) 25m 


cHAP. 11] ANSWERS TO PROBLEMS 


(@) = oe 
1 r/2 dO 
=) 2 = + 2cosé+ cos?6 
i 2 (d) y 
6.0+ 3V3 
7. 2a 
fh isv3 — =x 
9. 24r, 167 + 9V3, 82 — 9V3 
10. 8a 
995 
of. 5 
12. (a) 4 
13. (a) 4 (b) 8 
CHAPTER 11 
Section 1, pages 622-623 
eee oy =x + 2e* + c (b) y 
(c) y = tani’ + 3x + 0c) (d) y 
1 2 
r= ae ee See 
(g) y = 90nx)? +c (h) y 
(i) y = c,cos4x + ce sin 4x S) ee 
me y= ce + ce? qd) y 
(m) y = cye** + cre” (n) y 
(0) y = e~**(c, cos 5x + cp sin 5x) 
ay + oy — gx +e * se 
a x” =" oo ees 
Se y= @) ey 
(u) +Inj4¢ + 3] + $In[3x + 4| =c 
(vy) y= arcseeKe*) 
(w) y =e + (x) 
(y) tany = 3x — $sin2x +c GG) ¥ 
2. (a) y = 5e” (b) y 
(ya yp? = 25 is 
eae Ze" | Se (ff) y 
(g) y = Inj3x + 6 (h) y 


3x 
ae. 
4 
ae 
Z 


(f) 4r 


= 1 — 2x 


2° + 5x+Inx+ec 
ce(t/2) x? ee 1 


= 2xInx —2x+c 


4x —4x 
cje*” + coe 
ce 2* + coe °* 
oe ae a8 ala 


CX 
Ge + -coe7 


a 
EG a 


2 3 2 
—-3¢ 4x +extes 
x 
4 
x ee = 2 


—tegt? + vot + So 
e?*(5 cos 3x — 8 sin 3x) 
(—11x + 3)e** 
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3. (b) D’y — Dy + 3y = 0 
4. (a) y= e~*(cy cos $x + Cc sin 4x) 
: a 3 
(b) y= 3 @ (cos $x — sin $x) 
5. (a) y = e*(c, cos $x + ce sin 4x) 
by = .s e'(cos $x — 4 sin 4x) 
6.) ¥ = oc; cos 5X + co sin $x 
(b) y= -< (cos $x — sin 4x) 
7. y = e**(c; cos x + Co Sin x) 
Section 2, page 629 
i (a) ae Oe we ee (bb). ge So 
4 x 
7 
x 2 —7/2 e 5 —z? 
eee dea eae (dd) y=5 + ce 
(e) y= te” —4 de oe (f) y= 3 ie ce” 
gQy-ite°" (hy) y= 44+ ce00r™ 
(i) y = 2(2cosx + sinx) + ce” 
er 1 Cc as b Cc Sey 
O) y= 948 ie icles (kk) y= 73% ++ re 
20 
oS —9a a 
d) y= 7 + ce (m) y x3 = x3 
sin x ze 
my y= x E 
2. (a) ya Prk 
a. Aa) ce ne 
Section 3, pages 638-640 
1. (a) y = cye” + coe — Ze (b) y = cye* + coe —** — 2xe-2* 
(Cc) y= ce? + ce + 2x +3 @) y =cicosx + cosinx + 4e* 
(ec) y = c cos x + ess x + x? —4) 
12. (a) y = cye™ + coxe* + c3e7?* (b) y = cye* + coe?* + c3e7?* 
Section 4, pages 647-648 
1. (a) y = cye’* + ce (b) y = (ax + ce" 
(c) y= ce * + ce (d) y = e*(c, cos 3x + co sin 3x) 
(e) y= cje— 8/2) x 4. Cc e(t/2)x (f) = c\e~%* + Co 
(g) y=e (c,cos V5 x + ¢, sin V5 x) 
th) y=(ox+ejet”* (i) y=e “(ce,cosx + csin x) 
types q 3 
i) yee c1c0s = x + ep sin ~ > x 
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5. @) y= ce* + exe + ce = (6). y = exe + (eax + ee™ 
(c) y = cye** + coe* + c3e°* 
6. y = C, COS 2x + Co sin 2x aa c3e3* 


ae FP = ce” + (Gx + c3)e° (bo) y = ce" + ce + Ge 
(c) y = (ex + e)e* + ce" + cre" 
(@) y= ci + coe” + ese * (e) y = cye* + (cox? + c3x + cae 
(crx + ae + c3 COs V3x +c. sin V3x 
C1 + coe® + c3e7?* 
(h) py = (eax + ca)e~* cos x + (c3x + cy)e~* sin x 
(i) y = e-*[(ce1x + cx)cos x + (c3x + casin x + ¢5] 
G) y=e-[(eix + co)cos x + (eax + ca)sin x] + C5x + C6 
(k) y = cye®® + coe—** + €3 cos 3x + c4 sin 3x 
(l) y = cye-* + c2cosx + cs sin x 


Section 5, pages 656-657 
1 


a) Yn = —23, 9 = re + ce — 9 : 
Meee 2x, y = c, COSX + cosinx + 2x 
eg = —3x + 5, y = ce + cer — 3x + 5 
ee = x — 2x + 2, y = cye® ce x" — 2x + 2 


(ec) yp = —ie*, y = cie®* + coe-* — Ze? 
ey = Qe", y = (ow + coe” + 2e* 
eg. ey He ae” ce — ge 


a 7, =sil x, y = ¢ cos2x Ah cy sin 2x ++ ‘sin: x 

(i) yp = sinx +x+2,y =e, cos 2x + c:sin2x + sinx + x +2 

G) yp = cosx + 2sinx, y = ec, + coe ** + cosx + 2sinx 

(k) yp = Fe Be il lla fae 

(1) yp = —#cos 2x + sin 2x, 
y= ae ft V3) a i a 1—V3)z 


mem 3 COS x + 2x 


— 2cos 2x + sin 2x 


y = 3(e* — e* sin x) 
i = 3e~-*(cos 2t + sin 2¢ — 1) 


Bw oN 


(a) y= (cx + oe + x7 + 2x +4 
ny = ce"? + eye 2 Qn g 
(c) y = cye~** + coe* — Ze 
(d) y= cy + coe” — x® — 2x? + x 
(e) y = cye* + coe?” — 2xe7** 
(f) y = cie* + coe—2* — 2xe* —x +3 
(g) y = cye* + coe? — 2xe7** = Sxe* 
(h) y = cre~* + co + c3x — ex? + 7x? 
(i) y = c,cos2x + cosin2x — cos 3x 
(j) y = c1cos3x + c2sin3x — 3x cos 3x 
(k) y = cicosx + cesinx — 5xcosx + $e 


(l) y=crcosx + cesinx — 2x cosx — 4x sinx 
(m) y = (cyx + co)e* — 3e* sin x 
(n) y = e-*(c, cos x + cy sin x) + Ze*(cos x + sin x) 
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Section 6, pages 662-663 


1. (a) 5sinh5x (b) 12 cosh 3x sinh 3x 
x2 1. 
(c) 2x tanh(x” + 1) (d) xo | 
V/ x2 + ] 
(e) sech?x (f) —sech x tanh x 
(g) —cschxcothx (h) —csch?x 
. =Desece ti 7) ee 
(i) eee eo + pp (j) sinh (=) 
2. (a) +cosh7x +c (bys 2 sinh 5 +e 
(c) 5 cosh43x + ¢ (d) Incoshx +c 
(e) In |tanhx| +c (f) $cosh(2x? + 1)+c 
(g) 5 tanh®2x + ¢ (h) 4(nsinh x)? + c 
(i) fsinh 2x +5 +c (j) 4sinh 2x —5 + 


5. (a) y = cy cosh2x + co sinh 2x 


(b) y = c,cosh V7 x + co sinh V7 x 
(c) y = c, cosh 3x + ce sinh 3x — e?* 

1 ; 
(d) y = c, cosh kx + co sinh kx — = o <a sin x 


(e) y = c, cosh 4x + co sinh 4x + 3% sinh 8x 
(f) y = ce, cosh 4x + ec: sinh 4x + 2x sinh 4x 


9. (b) ; 


10. 3 sinh 2 
ee 1 


ee vite 


Index 


Abscissa, 10 
Absolute maximum point, 87 
Absolute minimum point, 87 
Absolute value, 6 

of complex number, 332 
Acceleration, 105 

integral of, 230 

vector, 585 
Addition of vectors, 563 
Algebraic function, 324 
Alternating series, 498 
Angle of complex number, 339 
Angle of inclination, 302 
Annihilator, 651 
Antiderivative, 203 
Approximate values, 109 
Arc length, 552 

integral formula, 553, 556 

polar coordinates, 610 
Arccosecant function, 320 
Arccosine function, 315 
Arccotangent function, 318 
Archimedean spiral, 597 
Arcsecant function, 319 
Arcsine function, 311 
Arctangent function, 316 
Area, 171 

between curves, 219 


integral formulas, 172, 195, 221 


polar coordinates, 604 


Argument of complex number, 339 


Associative laws 
complex numbers, 328 
real numbers, 2 
vectors, 566 

Asymptotes, 154 


Average value of a function, 405 


Average velocity, 104 
Axis 
conjugate (hyperbola), 156 


of parabola, 138 
x9 
Ys 9 ed 


Bound 
least upper, 7 
lower, 4, 164 
upper, 4, 164 
Bounded 
function on an interval, 164 
interval, 4 
set, 4, 164 
subset of 3-dimensional space, 433 


Cardioid, 601 
Catenary, 660 
Center 

of a circle, 132 

of an ellipse, 145 
Chain Rule, 69 
Change of Variable Theorem for Definite 

Integrals, 215 
Characteristic equation, 346, 633 
Characteristic polynomial, 633 
Characteristic properties of definite 
integral, 447 

Cirele, 12 

center and radius, 132 

equation of, 12, 132 
Common logarithm, 268 
Commutative laws, 2, 328, 566 
Comparison Test 

for integrals, 469 

for series, 494 
Complex exponential function, 336 
Complex numbers, 327 

absolute value, 332 

angle, 339 

argument, 339 

conjugate, 333 
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exponential form of, 338 third, 65 

imaginary part, 331 Derived vector, 570 

modulus, 332 Difference of two sets, 164 

nth root, 341 Differentiable function, 48, 543 

real part, 331 Differential, The, 116 
Composition of functions, 19 Differential equations, 272, 344, 617 
Composition of linear operators, 631 associated homogeneous, 640° 
Concave characteristic equation of, 346, 633 


downward, 86, 92 

upward, 86, 92 
Confocal hyperbolas, 161 
Conic sections, 132 
Conjugate axis (hyperbola), 156 
Conjugate hyperbolas, 155 
Constant function, 24 
Continuous function, 33, 542 
Convergent 


absolutely convergent series, 502 
conditionally convergent series, 502 


eventually, 496 

improper integral, 461, 463 
infinite sequence, 474 
infinite series, 483 


interval of convergence, 512, 516 


radius of convergence, 513, 516 
Coordinate functions, 541, 569 
Coordinates 

polar, 590 

of a vector, 561 

#10 

y, 10 
Cosecant function, 301 

hyperbolic cosecant, 660 

integral, 374 
Cosine function, 282 

derivative, 294 

hyperbolic cosine, 658 

integral, 295 
Cotangent function, 301 

hyperbolic cotangent, 660 
Critical point, 88 
Cross-sectional area function, 433 
Curtate cycloid, 549 
Curve 

parametrized, 542 

rectifiable, 552 

slope, 55, 546 
Curve sketching, 84 
Cycloid, 548 
Cylindrical shells, 439 


Decreasing function, 185 
Decreasing sequence, 478 
Definite integral, 169 
properties, 191, 668 
Degree, 288 
De Moivre’s formula, 344 
Dependent variable, 22 
Derivative, 48, 637 
second, 65 


characteristic polynomial of, 633 
constant coefficients, 345, 617 
first order, 272 
first order linear, 276, 623 
general solution, 273, 275 
homogeneous, 345, 617, 640 
integrating factor, 626 
linear, 276, 344, 617 
nonhomogeneous, 648 
nth order, 617 
second-order, 344, 617 
separable, 274, 614 
solution, 272, 617 

Differential operator, 618 
annihilator, 651 
linear, 632 

Differentiation 
arccosecant function, 321 
arccosine function, 316 
arccotangent function, 318 
arcsecant function, 320 
arcsine function, 313 
arctangent function, 317 
constant, 58 
constant times a function, 58 
cosecant function, 304 
cosine function, 294 
cotangent function, 304 
exponential functions, 252, 263 
implicit differentiation, 79 
logarithm functions, 243, 269 
logarithmic differentiation, 267 
product, 59 
quotient, 62 
secant function, 304 
sine function, 294 
sum, 57 
tangent function, 304 
x", 61, 64; Ti; 267 

Direction, vector, 568 

Directrix 
ellipse, 148 
hyperbola, 157 
parabola, 136 

Discontinuity, 33 
removable, 38 

Discontinuous functions, 38 
integration of, 452 

Disjoint sets, 164 

Distance 


along a curve (arc length), 281, 552 


INDEX 


straight-line between two points, 11 
traveled along a curve, 581 
traveled along a straight line, 232 
Distributive laws, 2, 328, 566 
Divergent 
improper integral, 461, 463 
infinite series, 483 
Division, existence of, 2, 328 
Domain of a function, 14 
Dummy variable, 171 


e, 245, 247 
Eccentricity 
ellipse, 148 
hyperbola, 157 
Ellipse, 143 
directrix, 148 
eccentricity, 148 
foci, 143 
major and minor axes, 144 
Empty set, 164 
Endpoints of an interval, 5 
Eguilateral hyperbolas, 159 
Equivalent parametrizations, 559 
Error 
alternating infinite series, 500 
Midpoint Rule, 426 
Remainder in Taylor Series, 533 
Simpson’s Rule, 430 
Trapezoid Rule, 419 
Even function, 90 
Exponential functions 
a, 202 
exp(x), 250 
e253 
e*, 337 
e7, 336 
Extreme point, 87 
Extreme value, 88 


Factorial, 507 

Finite sequence, 175 

Focus 
ellipse, 143 
hyperbola, 152 
parabola, 136 

Function, 13. - 
algebraic, 324 
average value, 405 
complex exponential, 336 
composition, 19 
constant, 24 
continuous, 33 
continuously differentiable, 559 
coordinate, 541, 569 
cosecant, 301 
cosine, 282 
cotangent, 301 
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decreasing, 185 
definite integral of, 169 
derivative of, 48 
differentiable, 48, 543 
discontinuous, 38 
domain of, 14 
even, 90 
exponential, 250, 253, 262, 336; 337 
graph of, 16 
hyperbolic, 658, 660 
identity, 22 
implicitly defined, 77, 79 
increasing, 185 
indefinite integral of, 209 
integrable, 168 
inverse, 249 
linear, 45 
logarithm, 238, 268 
mean of, 405 
monotonic, 185 
odd, 92 
of a real variable, 17 
periodic, 284 
piecewise continuous, 460 
polynomial, 34, 323 
range of, 14 
rational, 34 
secant, 301 
sine, 282 
step, 455 
strictly decreasing, 190 
strictly increasing, 190 
tangent, 301 
transcendental, 324 
trigonometric, 281 
value of, 14 

Fundamental Theorem of Calculus, 200 
Corollary, 204 


Generalized Mean Value Theorem, 122 
Geometric series (infinite), 487 
Graph 

of equation, 12 

of function, 16 

polar graph (in polar coordinates), 594 
Greatest Lower Bound, 9 (exercises) 


= 


Harmonic series, 484 
alternating, 498 
Homogeneous differential equations, 345, 
617, 640 
Hooke’s Law, 447 
Horizontal line, 41 
Hyperbola, 151 
asymptotes, 154 
confocal, 161 
conjugate, 155 
conjugate axis, 156 
directrix, 157 
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eccentricity, 157 

equilateral, 159 

foes, 152 

transverse axis, 153 

vertices, 153 
Hyperbolic functions, 657 

cosecant, 660 

cosine, 658 

cotangent, 660 

secant, 660 

sine, 658 

tangent, 660 
Hyperbolic spiral, 601 


Identities, existence of, 2, 328, 566 
Identity function, 22 
Imaginary number, 331 
Implicit 
differentiation, 79 


function defined implicitly, 77, 79 


Improper integral, 461, 463 
Increasing function, 185 
Increasing sequence, 478 
Indefinite integral, 209 
Independent variable, 22 
Indeterminate forms, 271, 310 
Inequalities, 3, 4, 664 
Infinite sequence, 175, 474 
Infinite series, 482 
absolutely convergent, 502 
alternating, 498 
alternating harmonic, 498 
Comparison Test for, 494 
conditionally convergent, 502 
convergence of, 483 
divergence of, 483 
geometric, 487 
harmonic, 484 
Integral Test for, 491 
linearity of, 485 
Maclaurin, 537 
nonnegative, 490 
p-series, 493 
power series, 511 
power series in x—a, 516 
Ratio Test for, 505 
Taylor, 528 
telescoping, 489 
Inflection, point of, 88 
Initial point of a vector, 560 
Integer, 1 
Integrability 
of a continuous function, 199 
of discontinuous functions, 452 
of a monotonic function, 185 
Integrable function, 168 
Integral 
of acceleration, 230 


INDEX 


Comparison Test for Integrals, 469 
convergent improper, 461, 463 
cosecant function, 374 
cosine function, 295 
definite, 169 
divergent improper, 461, 463 
equivalent definitions of, 672 
exponential functions, 254, 265 
form of Mean Value Theorem, 410 
improper, 461, 463 
indefinite, 209 
limit of Riemann sums, 414 
logarithm function, 355 
proper, 461 
properties of, 191, 668 
secant function, 369 
sine function, 295 
Test for infinite series, 491 
of velocity, 229 
Integrand, 190 
Integrating factor, 626 
Integration (See a/so Integral) 
by partial fractions, 384 
by parts, 354 
by trigonometric substitutions, 374 
limits of, 190 
of discontinuous functions, 452 
of trigonometric functions (techniques), 
362 
recursion formulas, 359, 361, 362, 368, 
369, 370 
Intercept (x- and y-), 46 
Interior of a set, 512 
Interior point, 512 
Intermediate Value Theorem, 258 
Intersection of two sets, 164 
Interval, 4 
bounded, 4 
closed, 5 
of convergence, 512, 516 
endpoints of, 5 
open, 5 
Inverse function, 249 
differentiability of, 261 
Inverse trigonometric functions, 311 
Irrational number, 9 (exercises) 
Irreducible polynomial, 381, 386 


Latus rectum 

ellipse, 150 

hyperbola, 161 
Least Upper Bound, 7 
Leibnitz’s Rule, 60 
Lemniscate, 598, 601 
Length 

arc length of curve, 552 

of vector, 561 
L’HO6pital’s Rules, 122-127 


INDEX 


Limacon, 601 
Limit 
or F(x), 31, 37 
indeterminate forms, 271, 310 
properties of limits, 32, 664 
of sequence, 177, 474 
Lines 
horizontal, 41 
parallel, 43 
straight, 39 
tangent, 48, 573 
vertical, 41 
Linear differential equation, 276, 344, 617 
first order, 276, 623 
Linear differential operator, 618, 632 
Linear function, 45 
Linear operator, 630 
composition of, 631 
product of, 631 
sum of, 630 
Local maximum point, 87 
Local minimum point, 87 
Logarithm 
common, 268 
natural, 238, 241 
to the base a, 268 
Logarithmic differentiation, 267 
Lower bound, 4, 164 
Lower integral, 174 
Lower sum, 165 


Maclaurin series, 537 
Major axis, ellipse, 144 
Maximum and minimum problems, 94, 
101 
Maximum point 
absolute, 87 
local, 87 
Mean of a function, 405 
Mean Value Theorem, 113 
generalized, 122 
integral form of, 410 
Mesh of a partition, 413 
Midpoint Rule, 423 
Minimum point 
absolute, 87 
local, 87 
Minor axis, ellipse, 144 
Modulus of complex number, 332 
Monotonic function, 185 
Monotonic sequence, 478 


Natural logarithm, 238, 241 
integral, 355 

Negative number, 3 

Newton’s law of gravitation, 449 

Nonhomogeneous equations, 648 

Nonnegative series, 490 
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nth order differential equation, 617 
Number 
complex, 327 
imaginary, 331 
irrational, 9 (exercises) 
negative, 3 
positive, 3 
rational, 1 
real, 1 
Numerical methods of integration 
Midpoint Rule, 423 
Tangent Formula, 424 
Trapezoid Rule, 417 
Simpson’s Rule, 429 


Odd function, 92 
Open interval, 5 
Ordered pair, 9 
Ordinate, 10 


p-series, 493 
Parabola, 136 
axis, 138 
directrix, 136 
focus, 136 
vertex, 137 
Parallel lines, 43 
Parallel translation, 584 
Parametrizations 
of a curve, 542 
continuously differentiable, 559 
equivalent, 559 
Parametrized curve, 542 
Partial fractions decomposition, 387, 392 
Partition, 164 
Period of a function, 284 
Periodic functions, 284 
Piecewise continuous function, 460 
Point of inflection, 88 
Polar coordinates, 590 
arc length, 610 
area, 604 
Polar graph, 594 
Polynomial, 34, 323 
irreducible, 381, 386 
of two variables, 323 
Positive number, 3 
Power series, 511 
function defined by, 519 
in x—a, 516 
interval of convergence, 512, 516 
radius of convergence, 513, 516 
Product 
of complex numbers, 327 
of functions, 20 
of linear operators, 631 
of real numbers, 2 
scalar, number with vector, 564 
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Prolate cycloid, 549 
Proper integral, 461 
Pure imaginary number, 331 


Radian, 288 
Radium, half-life of, 277 
Radius 
of circle, 132 
of convergence of power series, 513, 516 
Range of a function, 14 
Rate of change, 54, 103 
Ratio Test for infinite series, 505 
Rational function, 34 
Rational number, 1 
Real number, 1 
Real number line, 4 
Real-valued function, 17 
Rectifiable curve, 552 
Recursion formulas, 359, 361, 362, 368, 
369, 370 
Related rates, 106 
Remainder, of Taylor’s Theorem, 533, 539 
Removable discontinuity, 38 
Riemann, Bernhard, 413 
Riemann sum, 413 
Rolle’s Theorem, 111 
Roots of complex numbers, 341 


Scalar, 564 
Secant function, 301 
hyperbolic secant, 660 
integral, 369 
Second derivative, 65 
Second-order linear differential equation, 
344, 617 
Separable differential equation, 274, 614 
Sequence, 175 
bounded above, 479 
bounded below, 479 
convergence of, 474 
decreasing, 478 
finite, 175 
increasing, 478 
infinite, 175, 474 
limit of, 175, 474 
monotonic, 478 
of partial sums, 482 
Series, 177 (See also Infinite series) 
Set 
bounded, 164 
difference of, 164 
empty, 164 
interior of, 512 
intersection of, 164 
union of, 163 
Simpson’s Rule, 429 
Sine function, 282 
derivative, 294 


INDEX 


hyperbolic sine, 658 
integral, 295 
Slope 
of curve, 55 
of graph, 54 
of line segment, 41 
of non-vertical line, 42 
of parallel lines, 43 
of parametrized curve, 546 
of perpendicular lines, 44 
of vertical line, 42 
Solid, 433 
integral formulas for volume, 434, 437, 
440 
of revolution, 437 
Speed, 104 
vector, 578 
Spiral 
Archimedean, 597 
hyperbolic, 601 
Step function, 455 
Straight line, 39 
Strictly decreasing function, 190 
Strictly increasing function, 190 
Subtraction, existence of, 2, 328, 566 
Sum 
of complex numbers, 327 
of functions, 20 
of linear operators, 630 
of real numbers, 2 
of vectors, 563 
Summation, 177 
identitities, 179 
index, 177 


Tangent 
Formula, 424 
function (trigonometric), 301 
hyperbolic tangent, 660 


line, 48, 573 
vector, 571 
Taylor 


approximation, 529 
Formula with Remainder, 533 
series, 528 
Theorem, 531, alternative form, 533 
Telescoping series, 489 
Terminal point of a vector, 560 
Third derivative, 65 
Transcendental function, 324 
Transitive law, 3 
Translation, parallel, 584 
Transverse axis of hyperbola, 153 
Trapezoid Rule, 417 
Triangle inequality, 664 
Trichotomy, law of, 3 
Trigonometric identities, 283, 284, 286, 
287, 302 


INDEX 


Trigonometric substitutions, 374 
Trisectrix, 80 


Union of two sets, 163 
Upper bound, 4, 164 
Upper integral, 174 
Upper sum, 165 


Value of a function, 14 

Variable 
dependent, 22 
dummy, 171 
independent, 22 

Vector, 560 
acceleration, 585 
addition, 563 
coordinates, 561 
derived, 570 
direction of, 568 
initial point, 560 
length, 561 
product (scalar), 564 
properties of, 566 
scalar product, 564 
speed, 578 
subtraction, 563 
tangent, 571 
terminal point, 560 
velocity, 578 
zero, 563 
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Vector space, 567 
Velocity, 104 
average, 104 
integral of, 229 
vector, 578 
Vertex, parabola, 137 
Vertical line, 41 
Vertices, hyperbola, 153 
Volume of solids, 432 
cylindrical shells, 439 
integral formula, 434 
integral formula for solids of revolution, 
437 


Work 
done against a force, 444, 448 
done by a force, 444, 445 


x-axis, 9 - 
x-coordinate, 10 
x-intercept, 46 


y-axis, 9 
y-coordinate, 10 
y-intercept, 46 


Zero vector, 563 


